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Background of the Invention 
This application is a continuation-in-part of Dryja et 
al, U.S. S.N. 146,525, filed January 21, 1988, which is a 
continuation-in-part of Dryja et al. U.S. S.N. 895,163, filed 
August 11, 1986, both hereby incorporated by reference. 

This invention concerns the retinoblastoma gene and 
methods for detecting and treating patients afflicted with a 
defective retinoblastoma gene. 

Retinoblastoma is a neoplastic condition of the 
retinal cells, observed almost exclusively in children between 
the ages of 0 and 4 years. It affects between 1 in 34,000 and 
1 in 15,000 live births in the United States. (L.E. Zimmerman, 
1985, Retinoblastoma and ret inocytoma , Iji W.H. Spencer (ed.). 
Ophthalmic Pathology; an Atlas and Textbook . Vol, II, 
Philadelphia: W.B. Saunders Co., pp. 1292-1351,) If 
untreated, the malignant neoplastic retinal cells in the' 
intraocular tumor travel to other parts -of the body, forming 
foci of uncontrolled growth which are always fatal. The 
current treatment for a retinoblastoma is enucleation of the 
affected eye if the intraocular tumor is large; for small 
intraocular tumors, radiation therapy, laser therapy, or 
cryotherapy is preferred. There is no known successful 
treatment for metastatic retinoblastoma. As with most cancers, 
morbidity and mortality are reduced if diagnosis can be ra^ade 
early in the course of the disease. 
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In 30-40% of cases of retinoblastoma, the affected 
individual carries a heritable predisposition to retinoblastoma 
and can transmit this predisposition to his or her offspring as 
a dominant trait (A.G* Knudson, 1971, Mutation and cancer: 
Statistical study of retinoblastoma, Proc. Nat 1 . Acad> Sci . , 
Vol. 68, pp* 820-23) • Carriers of this 

retinoblastoma-predisposing trait are at a greatly elevated 
risk for development of several other forms of primary cancer, 
notably osteosarcoma and soft-tissue sarcoma. 

The genetic locus associated with familial 
retinoblastoma has been assigned to the ql4 band of human 
chromosome 13 (R-S. Sparkes et al., 1980, Science , Vol. 208, 
pp. 1042-44). Most retinoblastomas arise from cells which have 
lost both normal, dominant, homologous alleles at this ^ 
retinoblastoma locus. However, individuals carrying one 
defective allele may be predisposed to the disease. Children 
who have had one eye affected by retinoblastoma or who are 
related to someone with retinoblastoma may be genetically 
predisposed and therefore at risk of developing the disease. 
These individuals routinely are tested for retinoblastoma every 
2-3 months by an ocular examination procedure which requires 
placing the child under general anesthesia. 

Summary of the Invention 
In general, the invention concerns purified nucleic 
acid (less than lOOkb in size), and fragments thereof of at 
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least 15 bases, encoding the Rb gene. The invention also 
concerns cells transformed with this nucleic acid, isolated 
polypeptides encoded by this nucleic acid, and antibodies to 
this polypeptide, or to naturally occurring retinoblastoma 
polypeptide. Retinoblastoma polypeptide is the polypeptide 
encoded by the Rb gene. Further, the invention concerns a 
composition, suitable for treating a human having a defective 
Rb gene, containing retinoblastoma polypeptide^ or a fragment 
thereof, in a pharmacologically acceptable carrier. 

The invention also features methods of screening human 
patients to determine those not at risk of developing 
retinoblastoma and thus not requiring conventional examinations 
to be performed. This screening involves, for example, 
comparing nucleic acid of a patient with purified nucleic acid 
encoding a human Rb gene, or fragments thereof. 

Thus in various aspects, the invention features^ 
methods of analyzing the predisposition jof patients to 
retinoblastoma which involves detecting large and small 
deletions or point mutations in the retinoblastoma gene, or 
detecting the co-inheritance of such defects with specific 
restriction fragment length polymorphisms (RFLPs) , or detecting 
the presence or absence of a normal or defective retinoblastoma 
gene by hybridizing a nucleic acid sample from the patient with 
a probe specific for the retinoblastoma gene, and determining 
the ability of the probe to hybridize to the nucleic acid'. The 
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lack of hybridization to the nucleic acid indicates the 
presence of a large deletion in the gene. A probe specific for 
the retinoblastoma gene may be hybridized to fragments 
separated by a defined physical property from a sample of a 
patient, the hybrids of the probe and the fragments detected, 
and the hybrids compared to hybrids detected from the 
hybridization of the probe and separated nucleic acid fragments 
from a normal retinoblastoma gene. The absence of hybrids or 
presence of hybrids of a smaller size compared to a normal 
patient is an indication of large deletions in the 
retinoblastoma gene of the patient. Preferably, the probe 
specific for the retinoblastoma gene is the cloned DNA in 
p4.7R, or a fragment thereof; and the defined physical property 
is molecular weight. : 

Small deletions or point mutations can be detected by 
determining the nucleotide sequence of a retinoblastoma allele 
from a patient, and comparing the nucleotide sequence with the 
nucleotide sequence of a retinoblastoma allele/ or subregion 
thereof, from a person not afflicted with retinoblastoma; or by 
detecting mismatches between a nucleic acid sample from a 
patient and a probe specific for the retinoblastoma gene from a 
person not afflicted with retinoblastoma. The co-inheritance 
of specific genetic polymorphisms with the retinoblastoma gene 
may be an indication of the predisposition of a patient to 
retinoblastoma. According to this method, nucleic acid 
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fragments are generated from a sample of the patient, the 
fragments are separated according to a defined physical 
property of the fragments (e.g., molecular weight), a 
detectable probe specific for the retinoblastoma gene is 
hybridized to the fragments, hybrids of the probe and the 
fragments are detected, and the hybrids are compared to hybrids 
detected from the hybridization of the same probe and separated 
nucleic acid fragments from a sample of a parent of the patient o 

In another aspect, the invention features the use of 
an isolated normal human retinoblastoma gene to synthesize Rb 
polypeptide for use in the treatment of individuals determined 
to have a defective Rb allele. 

In yet another aspect, the invention features a method 
of detecting the presence of the retinoblastoma polypeptide in 
a tumor sample from a human patient, by producing an antibody 
to the retinoblastoma polypeptide, contacting the antibody with 
the tumor sample, and detecting immune complexes as an 
indication of the presence in the tumor sample of the 
retinoblastoma polypeptide. The absence of the polypeptide 
indicates that the tumor is caused by a defect in a 
retinoblastoma allele. This procedure would preferably involve 
contacting a tumor sample from a human patient with an antibody 
<e.gw monoclonal antibody) which specifically reacts with the 
retinoblastoma polypeptide, or a fragment thereof, and 
determining whether the antibody binds to cells of the tissue 
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Specimens. The absence of immune complexes is an indication 
that the tumor was the result of a defective retinoblastoma 
allele. 

Other features and advantages of the invention will be 
apparent from the following description of the preferred 
embodiments thereof, and from the claims. 

Description of the Preferred Embodiments 

The description of the preferred embodiments of the 
invention will follow the brief description of the drawings 
given below. 
Drawings 

Fig. 1 is a diagrammatic representation of a 
restriction map of the insert in the clone p4.7R. 

Fig. 2 is a diagrammatic representation of a - 
restriction map of the genomic locus of the retinoblastoma gene 

Fig. 3 is a diagrammatic representation of the vectors 
p2AR3.8 and p2AR0.9 of the invention. 

Fig. 4 is a scale map of the normal retinoblastoma 

gene. 

Fig. 5 (5-1 through 5-3) is a nucleic acid sequence of 
a cDNA of the normal retinoblastoma gene, with flanking regions 

Fig. 6 (6-1 through 6-9) is a nucleic acid sequence of 
exons of the normal retinoblastoma gene, with flanking regions. 

Fig. 7 is a restriction map of the retinoblastoma 
gene, showing the locations of DSPs. ' 
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Fig. 8 is a gel and a diagram showing the inheritance 
og the polymorphism RBI. 3 in a ret inoblastoma-prone family. 

Fig. 9 is a diagram showing the segregation of the DSP 
RBI. 3 in three families with hereditary retinoblastoma. 
Retinoblastoma Polypeptide 

The Rb polypeptide is the specific amino acid chain 
encoded by the nucleic acid sequence of the normal 
retinoblastoma gene. The Rb polypeptide of this invention 
includes: (1) naturally occurring retinoblastoma protein; (2) 
^ synthetically produced retinoblastoma polypeptide; and (3) 
retinoblastoma polypeptide produced from purified nucleic acid 

yj 

p3 (e.g., cDNA or genomic DNA) via an in vitro expression system. 

1^ Also included are biologically active fragments of Rb 

ffi polypeptide which either have a biological activity of 

p naturally occurring- Rb polypeptide, or include an epitope of 

IjJ this polypeptide and thus are suitable for production of * 

Rb-specific antibodies. 
^ Retinoblastoma Gene 

The Rb gene is that distinct nucleic acid sequence in 
the human genome, the absence or mutation of which predisposes 
one to retinoblastoma. The purified nucleic acid sequence 
encoding the retinoblastoma gene can be carried on vectors 
which can be propagated in cells. For the purposes of this 
invention, purified nucleic acid encoding the Rb gene is 
defined as nucleic acid isolated from its natural environment 
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(e.g-, cDNA or a fragment of genomic DNA) which hybridizes 
specifically to the retinoblastoma gene under hybridizing 
conditions. An example of purified nucleic acid which encodes 
the retinoblastoma gene, and is carried on a vector, is the 
cDNA clone p4.7R. This clone was obtained in the following 
manner* 

CDNA 

The human DNA probe pH3-8, isolated from a human 
chromosome 13 lambda phage library (M. Lalande et al., 1984, 
Cancer Genet . Cytogenet > , Vol* 13, pp. 283-95), was used in a 
chromosome walking technique to isolate and map 30 kilobases 
(kb) of genomic DNA surrounding the H3-8 sequence* One 
fragment generated by this technique, named p7H30.7R, was found 
to recognize a DNA sequence in the mouse genome as well a's one 
within human chromosome 13 (T.P. Dryja et al*, 1986, Proc, 
Natl * Acad. Sci . USA , Vol. 83, pp. 7391-94). The homology of 
p7H30.7R to both human and mouse DNA suggested that p7H30.7R 
contains coding sequences of a structural gene. 

To test this possibility, p7H30.7R was radiolabeled 
and used to probe a Northern blot of RNA isolated from three 
retinoblastoma tumors and an adenovirus 12-transf ormed human 
embryonic retinal cell line (Vaessen et al., 1986, EMBO 
Journal , Vol. 5, pp. 335-). The p7H30.7R probe hybridized to 
an RNA transcript of approximately 4.7 kb from the retinal cell 
line, but did not hybridize to any RNA transcripts from tjie 
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three tumor samples. 

Subsequently, RNA isolated from the adenovirus- 
transformed retinal cell line was used to construct a cDNA 
library. This library was screened with the labeled p7H30.7R 
probe. Several cDNA clones were isolated which had similar 
restriction maps. The longest of these, p4.7R, contained 4.7 
kb of DNA. The restriction map of the insert in the clone 
p4.7R is shown in Fig. 1, 

The p4.7R clone was used to screen RNA transcripts 
isolated from four retinoblastomas, an osteosarcoma, and the 
adenovirus-transf ormed retinal cells. In a Northern blot 
analysis of isolated RNA*s, the p4.7R probe cross-reacted with 
a 4.7 kb transcript in the transformed retinal cells which was 
not present in the four retinoblastoma and one osteosarcoma 
cell samples. 

Genomic DNA 

Clones containing genomic DNA including the 
retinoblastoma gene were isolated in" the manner described 
below. Recombinant bacteriophage libraries containing human 
genomic DNA fragments inserted in the lambda phage cloning 
vector EMBL-3 were constructed according to published methods 
<Seed et al., 1982, Gene , Vol. 19, pp. 201-209). Those 
recombinant bacteriophage which contain fragments of the 
retinoblastoma gene were initially detected by hybridization of 
the bacteriophage plaques with p4.7R. ; 



Thirty-six distinct recombinant bacteriophage that 
contain overlapping human genomic DNA fragments were isolated. 
Selected bacteriophage were plaque-purified and amplified, and 
the restriction map of each phage insert was determined by the 
method of Rackwitz et al., Gene , Vol. 30, pp. 195-200. 

With these bacteriophage a restriction map of a region 
that spans approximately 200 kb was constructed, shown in Figs. 
2 and 4. All of the known sequences present in the mRNA from 
the retinoblastoma gene are present in this cloned region. In 
aggregate, the human DNA sequences in this set of bacteriophage 

r 

represent the chromosomal segment of the Rb gene. In Fig. 2, 
the vertical marks above the map represent the location of 
Hindlll sites, and the vertical marks below the map represent 
the location of EcoRI sites. The boxed areas represent Hindlll 
fragments which contain sequences found in the cDNA (exons). 
Each double-headed arrow beneath the map represents a distinct 
recombinant bacteriophage clone. Fig* 4 shows all the 
recognition sites of the six restriction enzymes Hin d III, Eco R 
I, Xba I, Sac I, Sac II, and Bam H I (New England Biolabs, 
Inc.). Restriction endonuclease fragments that contained exons 
were identified by their hybridization withcDNA clones or 
synthesized oligonucleotide sequences based on cDNA sequence. 
These restriction fragments were subcloned in the plasmid 
vector Bluescribe (Stratagene, San Diego, CA) . A total of 24 
distinct plasmids were subcloned in this manner. ; 
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The number and size of each exon was determined by 
iterations of the following procedure. First, an 
oligonucleotide was synthesized that corresponded to the first 
20 nucleotides of the cDNA sequence. Using this 
oligonucleotide as a primer, the plasmid with a genomic insert 
containing the most 5* exon was sequenced. The resultant 
sequence was aligned with the cDNA sequence to determine the 
length of the first exon; the point at which the plasmid and 
cDNA sequences diverged marked the beginning of the first 
intron. This exon and the flanking regions were further 
sequenced using synthetic oligonucleotide primers to generate a 
continuous nucleotide sequence composed of 5* promoter 
sequence, exon 1, and the beginning of intron 1. The second 
and subsequent exons were defined by synthesizing sequencing 
primers corresponding to the next 20 nucleotides of cDNA 
sequence that had not been previously assigned to an exori. All 
exons and the immediately adjacent flanking intron sequences 
were sequenced in both sense and antisense directions. 

The dideoxynucleotide chain termination method of 
sequencing was carried out using the enzyme Sequenase (United 
States Biochemical Corporation, Cleveland, Ohio) according to 
protocols supplied by the manufacturer. The intron region 
downstream of exon 20 could not be sequenced by this method, 
due to an unusually problematic repeated sequence in this 
region that caused a series of 45 stops (bands appeared in all 
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four lanes of the sequencing gel). To resolve this region, 
sequencing reactions were carried out with Tag polymerase 
(Perkin-Elmer/Cetus) . This enzyme allowed for the 
polymerization to be performed at 68*C and resolved the bases 
in this region. 

All sequence data were analyzed and screened for 
overlapping regions using the sequence analysis program 
Microgenie Sequence Software (Beckman, Palo Alto, CA) . 

The position of each exon within the restriction map 
of the gene was determined by hybridization of cDNA fragments 
or synthetic oligomer sequences to recombinant bacteriophage 
DNA that had been digested with various restriction 
endonucleases. The precise location of most exons was 
subsequently deduced when recognition sequences of : 
endonucleases were identified within the intron-exon sequence 
and correlated with the map. The position of each of the 
remaining exons was arbitrarily placed in the center of the 
smallest restriction fragment to which it hybridized. 

The organization of the 27 exons along the genomic map 
of the retinoblastoma gene is illustrated in figure 4. This 
figure details the recognition sites for 6 restriction 
endonucleases and the position of the exons relative to these 
sites. Exons 1, 2, 3, 6, 9, 10, 13, 21, 22, 23, 24, 235, 26, 
and 27 have been precisely localized on this map. The other 
exons were mapped within small restriction fragments and are 
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illustrated in the middle of these fragments. Exons 11, 17, 
and the cluster of exons 14-16 were mapped by this technique 
with uncertainties of not more than 2.0 kb. The remaining 
exons (exons 4, 5, 7, 8, 12, 18, 19, and 20) were all mapped to 
within 0.8 kb of their true locations. For reference, this map 
also shows the position of several naturally occurring 
restriction fragment length polymorphisms. 

Fig. 6 (6-1 through 6-9) shows the sequence flanking 
and including each exon. The exons range in size from 31 
nucleotides (exon 24), to 1973 nucleotides (exon 27). The 
shortest intron sequence was found to be 80 nucleotides long 
and is located between exons 15 and 16, whereas the largest 
spans approximately 70.5 kb between exon 17 and 18. All of the 
intron donor and acceptor splicing sites comply with the :GT-AG 
splice junction rule. Our methods of sequencing proved more 
accurate than previous reports in defining the exact number of 
exons comprising the retinoblastoma gene. 

The first exon and the region immediately 5' to this 
exon are very G-C rich, which is a characteristic of promoter 
regions. This region contains 9 possible Hp a II restriction 
sites, is composed of 66% C+G nucleotides and does not exhibit 
CpG suppression. These criteria are indicative of a HTF 
island. This promoter region contained some nucleotides that 
could not be resolved in either the sense or antisense 
direction using either Sequenase or Tag polymerase. ; 
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Presumably, this was due to secondary structure that forms in 
this promoter region. 

Analysis of the sequence approximately 30 nucleotides 
upstream of the transcription initiation site defined by Lee et 
al- (1987b) does not reveal a TATA box that is found in other 
promoter regions. This suggests that either the previously 
published initiation site is not in fact correctly defined, or 
that the retinoblastoma gene lacks the prototypical TATA and 
CAAT boxes of promoter regions • Further analysis of the 
sequence 5' to exon 1 reveals a possible TATA box at base pair 
#-274, labeled base pair #122 in figure 6-1. Homology for the 
seven base region is only 57%, yet the first four bases 
T-A-T-A, which are the bases most frequently conserved, are 
100% homologous. A possible capping site, CAC, is locatdd 14 
bases away and again 49 bases away, CAAT boxes were not 
identified although the region is generally G-C rich. 

The intron sequence that flanks the 3* side of exon 20 
consisted of 21 consecutive repeats of the sequence TTT{T)C 
that together span 87 nucleotides. The number of repeat units 
can vary between different individuals and the alleles 
determined by the number of repeats behave like a heritable DNA 
polymorphism. 

A computer search of the sequence data identified 
several intron regions homologous to Alu repetitive sequences. 
Alu repeats were located in the following regions: (1) 
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downsteam of exon 2, between bp 492 and bp 704 according to the 
numbering scheme in Figure 6; 2) upstream of exon 9, between bp 
19 and bp 117; 3) downstream of exon 11, between bp 504 and bp 
680; 4) the intron sequnce flanking both sides of exon 14 
between bp 132 and bp 270 and between bp 420 and bp 741; and, 
5) upstream of exon 17, between bp 36 to bp 210. The Alu 
sequence located downstream of exon 2 contains two internal 
sequences that are highly conserved in Alu repetitive 
sequences. The first is a sequence (GAGGCNGAGC) corresponding 
to the T-antigen binding sequence of the SV40 replication 
origin. The second is a symmetrical sequence (CCAGCCTGG) of no 
known function. This short symmetrical sequence is also 
present in both of the Alu sequences on either side of exon 
14. Exons 14 and 15 are separated by a short intron that is 
almost entirely composed of Alu sequence, suggesting that exons 
14 and 15 were possibly at one time a single exon and wefe 
divided by the insertion of an Alu element during evolution. 
This Alu sequence may have been directed to this position by 
the other Alu sequence located on the 5* end of exon 14 because 
retroposons have a tendency to integrate adjacaent to one 
another . 

The 3' end of the retinoblastoma gene contains the 
usual polyadenylation signal sequence, AATAAA. One sequence 
(TGTGTTCT) located 32 bases downstream of this hexamer is 
equivalent to the conserved downstream consensus sequence] 
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(YGTGTYY) described by McLauchlan et al, (1985). This sequence 
and surrounding bases compose the "G/T cluster" generally found 
in a region 30 nucleotides downstream of the polyadenylation 
signal sequence (Birnstiel et al., 1985) • 

The p4.7R probe also was used to screen genomic DNA 
isolated from the tumors of 50 unrelated individuals (40 
retinoblastomas. 8 osteosarcomas, and 2 undifferentiated tumors 
of unknown cellular origin arising in patients with hereditary 
retinoblastoma), as described in more detail below. These DNA 
samples were digested with Hindlll and analyzed by Southern 
blot hybridization using radiolabeled p4 . 7R as the probe. This- 
analysis revealed three types of deviant patterns of the 
genomic DNA restriction fragments: totally absent fragments, 
representing apparent homozygous deletions; under-represented 
fragments, representing apparent heterozygous deletions; and 
fragments of altered size, reflecting either partial delation 
or an alteration of a restriction site- - At least 30% of the 
tumor DNA's exhibited one of these abnormalities. In 
comparison. Southern blot analysis of leucocyte DNA from 18 
normal individuals showed a uniform pattern of restriction 
fragments . 
Use 

The cDNA and genomic sequences, e.g., those in p4.7R, 
can be used, according to the invention, to screen indivitiuals 
for the presence of a mutated allele of the Rb gene. This 
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screening procedure will allow individuals having a risk of 
developing retinoblastoma — because of family history or a 
previous incidence of retinoblastoma in one eye — to determine 
the need for routine testing by the current ocular examination 
procedure. Only if the screening procedure determines that the 
individual possesses a mutant Rb allele will the examination 
procedure need to be conducted on a regular basis. Those with 
two normal Rb alleles can discontinue examination, as the risk 
of developing retinoblastoma in an individual with two normal 
copies of the Rb gene is approximately 1 in 20,000, or 0.005%, 
compared to a risk of 80%-90% if an individual has an Rb allele 
containing a mutation sufficient to inactivate the allele. 
Thus, a substantial percentage of individuals who are currently 
examined regularly are not actually at a greater risk than the 
general population: neither a family history of nor a previous 
incidence of retinoblastoma is conclusive evidence that an 
individual has the genetic predisposition to the disease. 
Therefore, such individuals, actually carrying two normal 
copies of the Rb gene, have been repeatedly undergoing the 
expensive and traumatic ocular examination procedure needlessly. 

The screening procedure according to the invention 
includes: (1) testing a nucleic acid sample of a patient for 
large deletions in the Rb gene locus; (2) testing a nucleic 
acid sample of a patient for small deletions or point mutations 
in the Rb gene locus; and (3) testing a nucleic acid sample of 
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a patient for RFLPs linked to the Rb gene locus. 

Detection of Large Deletions in the Rb Gene 
The availability of DNA probes from the Rb gene 
provides a means of directly detecting genetic lesions that 
create retinoblastoma-predisposing alleles. Suitable probes 
include the entire normal retinoblastoma gene sequence, or 
fragments thereof consisting of 15 or more bases encoding a 
specific portion of the retinoblastoma gene. When performed by 
Southern blot and dot blot procedures, this analysis is 
generally limited to the study of those lesions that create 
gross structural changes in the Rb gene, such as deletion of 
many hundreds of base pairs. 

The DNA for a Southern Blot or dot blot analysis is 
isolated from peripheral leucocytes or, if the patient has had 
a tumor in one eye, from the tumor. To examine leucocyte DNA, 
a 10 ml blood sample is obtained from the individual, and the 
genomic DNA is isolated from the leucocytes in the sample, 
according to standard techniques. This DNA is digested with a 
restriction endonuclease (e.g., Hin dlll) , and the resulting 
fragments are separated on an agarose electrophoresis gel 
according to a physical property such as molecular shape or 
molecular weight. For the purposes of this invention, 
molecular shape is defined as the structural configuration of 
the molecule (e.g., linear, circular, double-stranded or 
single-stranded). The DNA in the gel is transferred to a* 
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nitrocellulose filter by blotting. The filter is then probed 
with, e.g*, radiolabeled p2AR3.8 and, separately, p2AR0,9, 
containing subfragments from p4 . 7R obtained by EcoRl 
digestion* (The diagrams of the vectors p2AR3,8 and p2AR0.9 
are shown in Figure 3.) In order to more precisely define the 
location of any abnormalities detected, two or more subfragment 
probes are used separately rather than the entire p4.7R insert 
probe. The autoradiograms of the probed filter generate the 
data necessary to construct a restriction map of the Rb locus 
in the somatic or tumor DNA of the tested individual. 

This restriction map is compared with a control 
restriction map, determined by using the same restriction 
enzymes for digestion and the same probe. A suitable control 
is DNA obtained from an adenovirus-transf ormed retinal cell 
line or leucocyte DNA from a set of normal individuals. If the 
tested individual has an Rb allele containing a significantly 
large deletion, a restriction map of hia DNA, compared with the 
control, will contain an additional band or bands, and/or a 
band or bands that have lost 50% of their intensity, caused by 
a change in the size, or total elimination, of one or more 
restriction fragments by the deletion in one allele at the Rb 
locus . 

This screening procedure by Southern analysis will 
detect the existence of Rb alleles which have large deletions 
and are thereby non-functional. If this analysis indicates 
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that the tested DNA from an individual has a restriction map 
which is different from the control map, there is a high 
probability that the individual contains a non-functional, 
mutant Rb allele. The individual must be monitored closely 
thereafter for the development of retinoblastoma* 

If the test restriction map appears identical to the 
control, a different screening procedure can be performed to 
determine if the individual possesses an Rb allele having a 
small deletion or point mutation* Small deletions and point 
mutations may be sufficient to inactivate the allele, but not 
prevent hybridization with a probe* An example of this 
screening procedure is outlined below. 

Detection of Other Mutations in the Rb Gene 
To examine a DNA sample of an individual for small 
deletions or point mutations in the Rb locus, both homologs of 
the Rb gene from said individual are cloned. The cloned' 
alleles then can be tested for the presence of nucleic acid 
sequence differences from the normal allele, e.g., as 
represented by p4.7R, by one of the following two methods: (1) 
the nucleotide sequence of both the cloned alleles and p4.7R 
are determined and then compared, or (2) the RNA transcripts 
from p4.7R are hybridized to single stranded whole genomic DNA 
from an individual to be tested, and the resulting heteroduplex 
is treated with Ribonuclease A (RNase A) and run on a 
denaturing gel to detect the location of any mismatches. ! In 
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more detail, these methods can be carried out according to the 
following procedure- 

The alleles of the Rb gene in an individual to be 
tested are cloned using conventional techniques. A common 
method, for example, employs the bacteriophage vector EMBL3 
(Frischauf et al., 1983, J. Mol. Biol. , Vol. 170, pp. 827-). A 
10 ml blood sample is obtained from the individual. The 
genomic DNA isolated from the cells in this sample is partially 
digested with Mbol to an average fragment size of approximately 
20 kb. Fragments in the range from 18-21 kb are isolated. The 
resulting Mbo l -ended fragments are ligated into the EMBL3 
vector DNA which has been completely digested with BamHI, 
treated with alkaline phosphatase, and heated to 68**C for 10 
minutes to disrupt the cohesive ends. This ligation mix ds 
used in an in vitro lambda packaging reaction, and the packaged 
phage are amplified by growing a plate stock. (This clonring 
technique is described generally in Maniatis et al., 1982, 
Molecular Cloning: A Laboratory Manual , Cold Spring Harbor 
Publications, pp. 256-293.) 

Approximately 5 x 10* plaque forming units (pfu) 
from this plate stock are then screened with radiolabeled p4.7R 
by hybridization and autoradiography. Plaques which show 
hybridization to the p4.7R probe are plaque-purified and 
rescreened according to the above procedure. Positive plaques 
from the rescreening are isolated and used to prepare DNA- 
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putatively containing Rb alleles from the individual. 

The Mbo l genomic inserts in these -isolated EMBL3 
vector DNA samples are tested for the location of the sequences 
homologous to p4,7R by Southern analysis. DNA samples 
containing the entire Rb gene region are selected, and the 
appropriate restriction fragments containing the Rb gene from 
these samples are subcloned into a suitable vector, such as 
pUC9 • These subclones thus contain copies of one or both Rb 
alleles from the DNA of the individual to be tested. To 
determine if both alleles are represented, the initial phage 
isolates are tested for the existence of restriction 
polymorphisms* These subcloned alleles are then examined for 
differences from p4.7R by one of the following techniques. 

First, the nucleotide sequence of the normal Rb jgene 
in p4,7R is determined. Restriction fragments of approximately 
500 base pairs (bp) from p4 . 7R are subcloned into an M13rftp8 
phage vector and sequenced by the dideoxy technique (Sanger et 
al., 1977, Free. Natl. Acad. Sci. USA , Vol. 74, pp. 5463-). A 
composite sequence of the Rb gene then can be assembled from 
these individual subclone sequences. The complete sequence of 
the normal retinoblastoma gene and flanking sequences is shown 
in Fig. 5 and Fig. 6. 

The isolated Rb gene alleles are sequenced according 
to the following procedure. Restriction fragments (-- 2kb) of 
the allele are subcloned into the M13mp8 vector, and short 
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stretches (-500 bp) are sequenced individually using small 
restriction fragments isolated from p4.7R as the primers in the 
dideoxy sequencing reactions. The composite nucleotide 
sequence of the isolated allele then can be constructed from 
these individually-primed sequences. This sequence is compared 
directly with the sequence of the normal Rb gene, determined 
from p4.7R, to reveal any deletions or point mutations in the 
isolated allele* 

An alternative method of comparing the allelic DNA 
with the normal Rb gene employs RNase A to assist in the 
detection of differences between the p4.7R sequence and the 
allele sequence- This comparison is performed in steps using 
small (-500 bp) restriction fragments of p4.7R as the probe. 
First, p4,7R is digested with a restriction en2yme(s) thdt cuts 
the Rb gene sequence into fragments of approximately SOObp. 
These fragements are separated on an electrophoresis gei; 
purified from the gel and cloned individually, in both 
orientations, into an SP6 vector (e»g., pSP64 or pSP65; Melton 
et al., 1984, Nucleic Acids Res. , Vol. 12, pp. 7035-) . The 
SP6-based plasmids containing inserts of p4 . 7R fragments are 
transcribed iji vitro using the SP6 transcription system, well 
known in the art, in the presence of ta-^^P]GTP, generating 
radiolabeled RNA transcripts of both strands of the cDNA of the 
Rb gene* 

Individually, these RNA transcripts are used to form 
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heteroduplexes with the allelic DNA, as described by Myers et 
alw 1985, Science . Vol. 230, pp. 1242-46, the teachings of 
which are incorporated herein by reference. Mismatches that 
occur in the RNArDNA heteroduplex, owing to sequence 
differences between the p4.7R fragment and the Rb allele 
subclone from the individual, result in cleavage in the RNA 
strand when treated with RNase A, Such mismatches can be the 
result of point mutations or small deletions in the 
individual's Rb allele. Cleavage of the RNA strand yields two 
or more small RNA fragments, which run faster on the denaturing 
gel than the RNA probe itself. 

In the RNAse A technique, radiolabeled Rb gene RNA is 
hybridized to single strands of an individual's Rb alleles 
which have been cloned into a vector. The RNase A technique is 
advantageous, however, because it also can be used without 
having to clone the Rb alleles. Preferably, genomic DNA 'is 
isolated from blood cells of the individual to be tested, and 
this genomic DNA is hybridized directly with the radiolabeled 
Rb RNA probes to determine sequence differences from the normal 
Rb gene. Specifically, 5 ^g of isolated, total genomic DNA 
is resuspended with the labeled RNA probe in 30 \xl of 
hybridization buffer (80% formamide, 40mM Pipes pH6 , 4 , 0.4M 
NaCl, and ImM EDTA) , and this hybridization mix is treated at 
90°C for 10 minutes to denature the DNA. The mixture then is 
cooled slowly to 45*C and incubated at this temperature for 10 
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hours to allow hybridization of the RNA probe to the 
single-stranded DNA copies of the Rb allele. After 
hybridization, RNase A treatment and electrophoresis are 
performed as described by Myers et al*, supra . Mismatches 
between the RNA probe and the genomic copies of the 
individual's Rb alleles are then readily detected. 

Detection of RFLPs Linked to the Rb Gene 
The inheritance of a retinoblastoma- predisposing 
defect can be traced by following its co-inheritance with DNA 
polymorphisms in a pedigree analysis. 

The gene map shown in Figure 2 was used to develop 
nucleic acid probes useful for retinoblastoma diagnosis. To do 
so, the bacteriophage DNA corresponding to the human inserts 
were subcloned in the plasmid vector "Bluescribe" • 
(Stratagene) . Fifteen single-copy DNA fragments from the gene, 
ranging in size from 500 bp to 2000 bp, were subcloned* 'These 
sequences are scattered over the 200 kb -of the mapped region. 
Subcloned DNA fragments were separated from vector sequences by 
digestion of plasmid DNA with one or more restriction 
endonucleases , electrophoresis through a 0.6% low-melting-point 
agarose gel, and purification by chromatography using an 
Elutip-d column (Schleicher and Schuell) . Purified DNA 
fragments were radiolabeled with ^^P-dCTP (New England 
Nuclear) by the random primer technique using the Klenow 
fragment of DNA polymerase I. 
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Restriction fragment length polymorphisms (RFLP*s) 
were discovered by digesting genomic DNA isolated from six 
normal individuals with 33 different restriction enzymes. The 
DNA fragments resulting from the 198 separate digests were 
separated on a 0*8% agarose electrophoresis gel according to 
molecular shape or molecular weight* The DNA was transferred 
to nitrocellulose filters and hybridized with single copy DNA 
probes purified from the retinoblastoma gene according to 
published methods (T,P, Dryja et al., 1986, Proc, Natl > Acad. 
Sci. USA , Vol. 83, pp. 7391-94). 

Of the 15 single-copy probe fragments, only five 
reveal RFLP's. Four of the polymorphisms appear to be the 
result of minor alterations (perhaps single base changes) in 
the recognition sequence of a restriction endonuclease ( Kpn l , 
Xbal, Mbo I I , or Tthllll). The fifth polymorphism reflects 
variability in the number of tandem repeats of a 50 base^'pair 
sequence. The location of the DNA polymorphisms are shown in 
the map in Figure 2 (vertical arrows above the map). The 
location of the polymorphic Mbo I I site(s) has not been 
determined precisely but is located at approximately 175 kb on 
this map. The frequencies of alleles which correspond to 
particular DNA polymorphisms are indicated in Table 1. 

In order to demonstrate the utility of these probes to 
detect the presence of retinoblastoma-predisposing alleles in 
humans, twenty pedigrees with hereditary retinoblastoma wjere 
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analyzed. DNA was extracted from leucocyte nuclei of venous 
blood from available family members according to the method of 
Kunkel et al., 1977, Proc. Nat 1 . Acad. Sci. USA , Vol. 74, pp, 
1245-49, hereby incorporated by reference. For analysis of a 
kindred with a given RFLP, DNA from the available family 
members was digested with the appropriate restriction 
endonuclease . The resulting fragments were separated by 
agarose-gel electrophoresis, transferred to nitrocellulose 
filters, and hybridized- to labeled probe. 

In these families, the inheritance of alleles 
determined by the DNA polymorphisms within the retinoblastoma 
gene were traced and compared to the inheritance of the 
retinoblastoma-predisposing trait. For example, consider the 
polymorphism detected by probe p68RS2.0 (see Table 1). V^hen 
genomic DNA is digested with the restriction enzyme Rsa l # this 
probe hybridizes to allelic DNA fragments of different 
lengths. The size of these fragments ranges from 1.5 kb to 2.0 
kb with intervals of approximately 50- bp. The DNA sequence of 
the 2.0 kb genomic fragment cloned in p68RS2.0 has a 50 to 53 
bp segment which is repeated approximately 30 times (Table 2). 
This 53bp segment can be used as a probe in these analyses. A 
portion of the repeated sequence has homology to core sequences 
of VNTR's (Variable Number of Tandem Repeat) reported elsewhere 
(Y. Nakamura et al., 1987, Science , Vol. 235, pp. 1616-22). 
(The 11 bp sequence shown in Table 2 above the repeat unit 
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TABLE 1 







• 


DNA POLYMORPHISMS 








IDENTIFIED 

JL, w L « XXL X 1^ i_/ 


WITHIN THE 


RETINOBLASTOMA GENE 




DNA 




RESTRICTION 


MlAP 


ALLELES 




PROBE 




ENDONUCLEASE 


LOCATION 


SIZE (Kb)-* FREQUENCY* 




p68RS2 . 


0 


Rsal 


142-143 kb 


2.00 


0.13 










1.95 


0.02 










1.90 


0 . 07 










1.85 


0.07 










1.80 


0.35 










1.75 


0 . 20 










1 . 65 


0.09 










1.50 


A Art 

0 . 09 


P88PR0. 


6 


^ Xbal 


120 kb 


7.0 


0.55 










5.5 


0.45 


P35R0 . 6 




Tthllll 


19 5 kb 


4.95 


0.20 










4 .35 


0.80 


p2P0.3 




MboII 


175 kb^ 


1.0 


> .90 










0.8 


<.05 










0.6 


< .05 










0.3 


:<.05 


P95HS0. 


5 


Kpnl 


2 5 kb 


12.0 


0.95 










8.0 


0.05 



^The map location of each polymorphic site 
refers to the position on the restriction map of the gene shown 
in Figure 1. 



^The location of the Mbo II site is approximate, 
since the precise position of this site within the map is not 
yet known. 

^The allele sizes were calculated from several 
indepedent measurements using Hindlll fragments of lambda phage 
DNA as a standard. 

^Allele frequencies are based on a population of 
40-60 unrelated individuals. 

^p88PR0.6, P35R0.6, p2P0.3 and p95HS0,5 are • 
probes isolated from other regions of the retinoblastoma gene, 
shown in Fig. 2. 
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represents the core sequence reported for some VNTR's observed 
by Nakamura et al.) Because such tandemly repeated sequences 
tend to be genetically unstable, the number of repeats is 
highly variable. Eight distinct alleles at this site have been 
detected, and more may exist. Because of this number of common 
alleles, seventy-five percent of unrelated individuals are 
heterozygous for this polymorphism. The high frequency of 
heterozygosity makes this polymorphism extremely useful. (In 
Table 2, the brackets and ) " denote regions of variability 

within the repeat unit; and the bases underlined above and 
below the bracketed regions denote possible alternate bases for 
those regions of variability.) 



constellations of alleles at this DNA polymorphism. This 
variation allows an examination of the frequency of 
co-inheritance of this site with the retinoblastoma- 
predisposing trait. For example, genomic DNA of individuals in 
these families was digested with Rsa I and the co-inheritance 
of any one Rsa I fragment with defective Rb alleles deter^iined. 



TABLE 2 

SEQUENCE OF THE REPEAT UNIT WITHIN p68RS2.0 



GGGNNGTGGGG 



TG A lllllli II AG 

CGTGTTGAA { _A ) ACAC ( C ) TCCCCAATGCTGGAGGTGAGGTTTGGT { AGAAG 
G 




Fourteen retinoblastoma families carried 
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Sequence Analysis 

RFLP analysis can reveal only those sequence 
variations that give rise to a restriction fragment that is 
detectably different on a Southern blot from restriction 
fragments characterizing the normal nucleic acid. Most such 
detectable polymorphisms result from DNA sequence variation 
within a restriction endonuclease recognition site. These 
sites. are rare, and finding them requires a laborious and 
expensive screening process in which genomic DNAs from several 
unrelated individuals are digested with as many as 50 different 
^3 restriction enzymes. The fraction of all genomic DNA sequence 
n polymorphisms (DSPs) at a specific locus that can be detected 
as RFLPs is small and depends on the number of enzymes used for 
screening; generally 90% or more of the DNA sequence 
p polymorphism in the human genome is not within reach of 

iTTT; ^ 

[y RFLP-based analysis. Tens or even hundreds of potentially 
useful DSPs may exist within or near most disease-causing 
genes, but often only a few and sometimes none of these DSPs 
are detectabale as RFLPs, Fig. 4 shows a scale map of the 200 
kilobase genomic region that includes the 27 exons of the human 
retinoblastoma gene. The 27 exons make up a 4.7 kilobase 
transcript. 

Fig. 7 shows the locations of the DNA sequence 
polymorphisms (DSPs) identified in this gene. Polymorphisms 
identified by the name of a restriction enzyme are RFLPsr 
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polymorphisms RB1,3, RBI. 20 and RBI. 26 are not 

detectable as RFLPs , and were found by PCR-amplif ication and 
direct sequencing, as described below. The 200 kilobase 
genomic region was isolated in a series of overlapping inserts 
from 35 distinct recombinant bacteriophage lambda clones. 
Exon-containing segments were subcloned into bluescribe plasmid 
cloning vectors (Stratagene, Inc.)* Initial sequencing of 
cloned plasmid inserts was carried out using conventional 
methods for plasmid sequencing. Based on this genomic 
sequence, pairs of 20-base oligonucleotide primers were 
synthesized so that numerous regions 320 - 1200 bp in size 
could be amplified from genomic DNA by the polymerase chain 
reaction (PGR) method of Mullis et al. 

For each amplification reaction, from 200 ng to ^1.0 ug 
of genomic DNA was prepared in a reaction buffer containing 20 
mM Tris (pH 8.4 or pH 8.6), 30 ug/ml bovine serum albumin, 300 
mM to 7.5 mM, 10-50 pM of each oligonucleotide primer, and 1 
unit of Taq polymerase (Perkin-Elmer Cetus) • Optimal MgClz 
concentrations and pH of the PGR reactions varied depending on 
the primer pair. PCR-amplif ication (30-35 rounds) was carried 
out following a cycle of 10 seconds at 94**C (denaturation) , 10 
seconds at 42-55**G (annealing), and 30 seconds at 70*G 
(Polymerization) using a programmable thermal cycler (Ericomp 
Gorp., San Diego). All times are based on sample temperature 
rather than heat-block temperature, and do not include 'riamping 
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time' for the heat block. Optimal annealing temperatures 
varied for each primer pair. This protocol allowed 
PCR-amplif ication of regions as large as 2500 bp. Genomic DNA 
from 9-20 individuals was amplified for each region and 
screened for DSP, 

In order to detect and utilize a greater fraction of 
the existing DNA sequence polymorphism in the retinoblastoma 
gene, we applied techniques of polymerase chain reaction (PGR), 
generally as described by K*B. Mullis et al., 1987, Methods 
Enzymol . , Vol. 155, pp* 335-51, and direct sequencing, 
generally as described by C. Wong et al., 1987, Nature , Vol, 
330, 384-86, to analyse normal allelic variation at this 
locus. Oligonucleotide primers were synthesized to amplify 
regions from the gene that varied in size from 320 - 1200 bp. 
Amplification and sequencing were carried out on DNA from at 
least 9 unrelated individuals for all regions screened, though 
for many regions 15 or more individuals -were analyzed. In most 
cases, primer pairs were derived from intronic sequences that 
flanked one of the 27 exons of the gene, such that the 
PCR-amplif ied region contained both intron and exon sequences. 

The results of this screening process are shown in 
Tables 3 and 4. Amplified DNA sequences were compared to one 
another and checked against sequence data from previously 
cloned plasmid inserts derived from the same region. Bases 
obscured by technical artifacts or other ambiguities were not 
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tabulated. Of 3712 bp of genomic DNA sequence screened at this 
locus, four sequence variations were identified (Table 3; map 
locations are shown in Fig, 7). All four variations were found 
in introns; of these, one is likely a rare variant (found in 
only 1 or 15 individuals sequenced) , and three represent bona 
fide DNA sequence polymorphisms. A representative example 
(RBI. 3) is illustrated in Fig. 7. This polymorphism occurs 
near exon 3 of the retinoblastoma gene. Neither form of the 
polymorphic sequence forms the recognition site of a known 
restriction enzyme, and hence this DSP is not detectable as an 
RFLP. Among a total of 82 genetically distinct (from unrelated 
individuals) alleles examined, no other base was observed at 
this site. 

Fig. 8 illustrates the inheritance of the polymorphism 
RB1.3 in a ret inoblas toma-prone family. Oligonucleotide 
primers (see Table 4) were used to PCR-amplify a 530 bp region 
of the human retinoblastoma gene that includes exon 3. The 
amplified fragment was sequenced by the methods described 
below. The sequence surrounding the polymorphism is written at 
the left side of the figure, read 5* to 3' from bottom to top, 
and the polymorphic bases are identified by adjacent tic marks. 

The details of the analysis were as follows. Prior to 
sequencing, all PCR-amplif ied DNA samples were treated with 
proteinase-K and extracted with phenol/chloroform. High 
molecular weight DNA was separated from unused dNTPs and ! 
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oligonucleotide primers by column purification through 
sepharose CL-6B (Pharmacia)* 250-400 ng of double-stranded 
PCRF-amplif ied template was combined with 1-2 pM of (^^P) 
end-labeled sequencing primer, and heat-denatured for 3 minutes 
at 96**C* This primer-template mixture was added to a buffer 
containing: MgCLz (2.5 mM) , Tris-HCL pH 7.5 (5 mM) , 6 units 
Sequenase (U.S. Biochemical) and di thiothreitol (3 mM) , and 
divided into 4 reaction mixtures each containing all four 
deoxynucleotides (32 uM each) and one didebxynucleotide (5 
uM) . This mixture was immediately incubated for 5 minutes at 
37-42*C, and polymerization was stopped with a 0.37% EDTA stop 
buffer. Prior to loading on sequencing gels, the samples were 
heat denatured at 96*C for 2 minutes. Conventional 0-4 mm 
thick, 6% polyacrylamide sequencing gels were used, and • 
autoradiography was typically for 12-24 hours without an 
intensifying screen. 

Fig. 9 illustrates segregation -of the DSP RBI. 3 in 
three families with hereditary retinoblastoma. Alleles are 
shown beneath the symbol for each person • Affected individuals 
are indicated by filled symbols. In family RB-32, the (-) 
allele is the result of an intragenic deletion. By subsequent 
Southern blotting studies, the deletion was found to extend 
from exon 2 to exon 17, Based on these results, it can be 
predicted that the unaffected members of family RB-32 who. carry 
the (G,-) genotype are also carriers of the mutation. ! 
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The DSPs we have detected are valuable genetic markers 
for our studies of hereditary retinoblastoma. In its 
hereditary form, a predisposition to the disease is passed from 
affected individuals to their offspring as a dominant trait 
with 90% penetrance. It can be seen from Fig. 8 that the 
affected father, who has passed the disease to two children, is 
heterozygous for RBI. 3. Both affected children received the G 
allele, while the unaffected child inherited the allele marked 
by an A at this polymorphic site. In this family, then, 
inheritance of the G allele from the affected parent is in 
phase with and diagnostic for the disease-predisposing 
phenotype. Fig. 9 shows our analysis of three other 
retinoblastoma-prone families using RBI. 3. Inheritance of the 
polymorphic markers we describe here has followed the expected 
Mendelian pattern in every family examined so far. No 
cross-overs were observed between the polymorphic sites and the 
retinoblastoma-predisposing trait in any of the pedigrees. 
This follows our expectations since the polymorphisms are 
within the disease gene. In family RB-32, an intragenic 
deletion in one copy of the RB gene, presumably causing the 
predisposition to the tumor, was identified by Southern 
blotting (data not shown). The deletion includes the region 
surrounding RBI. 3, and hence carriers of the 

disease-predisposing allele are genotypically hemizygous for 
the A allele (A,-). Two unaffected members of pedigree RB-32 
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are carriers for the disease-predisposition, based on analysis 
of RBI. 3 (see Fig. 9)- More happily. Fig. 9 shows that the 
other unaffected children in pedigree RB-32, as well as those 
in pedigrees RB-36 and RB-50, are not carriers of the 
cancer-predisposition and therefore will not pass the disease 
on to their children. These results highlight the diagnostic 
value of this class of human genetic markers that were 
heretofore unavailable for this purpose. 

The data we present may also be used to estimate the 
level of heterozyygosity in the human genome from a novel 
perspective. Previous estimates based on restriction enzyme 
screening may be subject to a bias because the sequences 
recognized by these enzymes do not necessarily reflect a random 
sampling. It is likely that a substantially higher level of 
polymorphism occurs at CpG pairs than elsewhere. This is 
reflected by the relatively high proportion of RFLPs revealed 
by such enzymes as Msp I (CCGG) and Tag *I (TCGA) . The method 
we describe is not subject to this bias. From the results of 
our screening, it can be calculated that genomic heterozygosity 
at this locus is approximately h»0. 00039. If only the intron 
seqeunces are considered, this estimate increases to 
h=0. 00070. These estimates are below the predictions of 
others, and may reflect the absence from our methods of the 
bias described above. However, an analogous calculation of 
heterozygosity (0.00044 < h < 0.00087) based instead on our 
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initial RFLP screening is also below the estimates of others 
and is quite consistent with our estimate based on direct 
sequencing. It seems likely that the human retinoblastoma gene 
is intrinsically less polymorphic than many other regions of 
the genome. Although mutations in this gene are known to be 
early events in the formation of several types of cancer, it is 
unclear why polymorphism at this locus may have been selected 
against in human evolution. 

The approach for detecting DSPs demonstrated here has 
several advantages over conventional RFlP-based screening. As 
we have argued, DSP screening by amplification and direct 
sequencing could increase by an order of magnitude the number 
of available polymorphic markers at any cloned locus. This 
technique encompasses and supercedes restriction enzyme-lsased 
screening since RFLPs and VNTRs may also be detected. As the 
only requirement for utilization of such markers is knowledge 
of a unique set of amplification primer -sequences and of the 
polymorphism itself, publication of a polymorphism immediately 
makes it available to all readers. Hence, problems and delays 
associated with the physical transfer of plasmid DNAs between 
laboratories are avoided, and the costs of maintaining plasmid 
repositories will be ultimately reduced. In addition, rapid 
analysis of these polymorphic markers can be carried out on a 
large scale with the use of allele-specif ic oligonucleoticie 
probes for direct hybridization to amplified DNA. Finally, 



- 39 - 

based on our experience with both strategies at the same locus, 
we found the expense and effort required to locate DSPs by 
either method to be comparable. 

Treatment of Patients Having a Defective Rb Gene 
In addition to screening, the invention includes 
polypeptide therapy for those individuals determined to contain 
a defective Rb allele, and who therefore are at risk of 
developing retinoblastoma , 

To prevent the formation of retinoblastoma in these 
individuals, the Rb polypeptide is administered therapeutically 
in an amount sufficient to inhibit retinoblastoma tumor 
formation or growth (anti-retinoblastoma-f orming amount). An 
anti-retinoblastoma-f orming dosage of the Rb polypeptide is 1 
to 500 vig/l^ilog^2i^ of body weight/day. The Rb protein can be 
administered by injection with a pharmacologically acceptable 
carrier, either alone or in combination with another agent. 
Acceptable pharmacological carriers are *those which dissolve 
the Rb polypeptide or hold it in suspension, and which are not 
toxic to the extent of permanently harming the patient. 
Preferred are aqueous solutions of salts or non-ionic compounds 
such as sodium chloride or glucose, most preferably at an 
isotonic concentration. Other agents may be present provided 
that they do not interfere with the action of the Rb 
polypeptide. Those skilled in the art will know, or will^ be 
able to ascertain with no more than routine experimentation. 



- 40 - 

particular pharmacological carriers for this composition. 

Rb polypeptide suitable for therapy can be prepared by 
any one of the following three conventional procedures. First, 
the Rb polypeptide can be produced by cloning the Rb cDNA from 
p4 . 7R into an appropriate mammalian expression vector, 
expressing the Rb gene product from this vector in an in vitro 
expression system, and isolating the Rb polypeptide from the 
medium or cells of the expression system. General expression 
vectors and systems are well known in the art. 
p.. Second, the Rb polypeptide can be produced using 

fi protein chemistry techniques, wherein the specific amino acid 
'r[ residues are joined together synthetically in the appropriate 
fZ sequence . 

^ Third, naturally occurring Rb protein can be isolated 

O from total protein samples by affinity chromatography. 

yj Antibodies specific for the Rb protein are prepared by standard 

procedures (see below) and coupled to an inert matrix, which is 

then used to selectively bind the Rb proteins. 
Immunodiaqnosis of Retinoblastoma 

This invention also includes methods for determining 
whether a particular tumor is the result of an Rb gene 
abnormality. Since osteosarcomas and certain undifferentiated 
tumors can result from detectable lesions in the Rb gene, 
immunodiagnosis can be used to aid in the diagnosis of such 
tumors. 
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In order to produce anti-Rb antibody, a rabbit is 
immunized with either naturally occurring Rb protein or Rb 
polypeptide produced as described above. The anti-Rb antibody 
generated is th6n labeled, e.g., radioactively , f luorescently , 
or with an en«yme such as alkaline phosphatase. The labeled 
antibody is* used to determine whether human tumors are of 
defective* Rb gene origin. This can be carried out using any 
conventional technique. For example, the tumor sample can be 
liquified and tested against the labeled antibody using a 
conventional ELISA (Enzyme- linked immunosorbent assay) format. 
Alternatively, human tissue samples (e.g., biopsy samples) can 
be tested for expression of the retinoblastoma protein by other 
immunological techniques, see e.g., I. Roitt Jiiteraction of 
Antigen and Antibody, In Essential Immunology , Fifth edition, 
Boston: Blackwell Scientific Publications, 1984, pp. 145-75. 

Immune complexes will be detected in tumor samples 
which have antigens (e.g., retinoblastoma polypeptide) reactive 
with anti-Rb antibody. Tumors which lack these antigens 
presumptively have a defect (e.g., mutation or a deletion) in 
the retinoblastoma gene. 
Deposits 

Plasmids p2AR3 . 8 and p2AR0.9 were deposited on July 
17, 1987 with the American Type Culture Collection, Rockville, 
Maryland, and assigned ATCC accession numbers 40,241 and 
40,242, respectively. ! 
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The Applicants represent the ATCC is a depository 
affording permanence of the deposit and ready accessibility 
thereto by the public if a patent is granted. All restrictions 
on the availability to the public of the material so deposited 
will be irrevocably removed upon the granting of a patent. The 
material will be available during the pendancy of the patent 
application to one determined by the Commissioner to be 
entitled thereto under 37 CFR 1.14 and 35 USC 122. The 
deposited material will be maintained with all the care 
necessary to keep it viable and uncontaminated for a period of 
at least five years after the most recent request for the 
furnishing of a sample of the deposited microorganism, and in 
any case, for a period of at least thirty (30) years after the 
date of deposit or for the enforceable life of the patent, 
whichever period is longer. Applicant acknowledges its duty to 
replace the deposit should the depository be unable to fUrnish 
a sample when requested due to the condition of the deposit. 

Other embodiments are within the following claims. 
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Table 3 

DNA sequence polymorphisms detected by direct sequencing 



Base Pairs Screened Polymorphisms 
Introns 2072 4 

Exons 1640 0 



Totals 3712 4 

DNA sequence polymorphisms found by direct sequencing of 13 
separate PCR-amplif ied regions from the human retionoblastoma 
locus, DNA samples from a minimum of 9 unrelated individuals 
were examined for all bases screened. Bases that could not be 
scored unambiguously were excluded from this tabulation. • 
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Claims 

1. Purified nucleic acid comprising a human 
retinoblastoma gene, or a fragment thereof comprising 15 or 
more bases, said nucleic acid being less than lOOkb in size. 

2. A vector comprising the nucleic acid of claim 1, 

3. A cell transformed with DNA encoding 
retinoblastoma polypeptide or a fragment thereof. 

4. The nucleic acid of claim 1, wherein said nucleic 
acid hybridizes specifically to said retinoblastoma gene. under 
hybridizing conditions. 

5. An isolated polypeptide encoded by the nucleic 
acid of claim 1. 

6. An antibody produced to the polypeptide of claim 5. 

7. An antibody produced to naturally occurring* 
retinoblastoma polypeptide. 

8. A method of detecting large deletions in the 
retinoblastoma gene of a human patient predisposing said 
patient to retinoblastoma, comprising the steps of: 

hybridizing a nucleic acid sample from said patient 
with a probe specific for the retinoblastoma gene, and 

determining the ability of said probe to hybridize to 
said nucleic acid, 

wherein lack of hybridization to said nucleic acid 
indicates the presence of a large deletion in said gene. , 

9. A method of detecting large deletions in the 
retinoblastoma gene of a human patient that may predisposing 
said patient to retinoblastoma, comprising the steps of: 
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generating nucleic acid fragments from a sample of 
said patient, 

separating said fragments according to a determined 
physical property of said fragments, 

hybridizing a probe specific for the retinoblastoma 
gene to said fragments, 

detecting hybrids of said probe and said fragments, and 

comparing said hybrids to hybrids detected from the 
hybridization of said probe and separated nucleic acid 
fragments from a normal retinoblastoma gene, 

wherein the absence of hybrids, or the smaller size of 
said hybrids from the sample of said patient is an indication 
of large deletions in the retinoblastoma gene of said patient. 

10. The method of claim 8 or 9, wherein the probe 
specific for the retinoblastoma gene is the cloned DNA in 
p4,7R, or a fragment thereof. 

11. The method of claim 9, wherein the physical 
property is molecular weight. 

12. A method of detecting small deletions or point 
mutations in the retinoblastoma gene of a human patient 
predisposing said patient to retinoblastoma, comprising the 
steps of: 

determining the nucleotide sequence of a 
retinoblastoma allele, or subregion thereof, from said patient, 
and 



comparing said nucleotide sequence with the nucleotide 
sequence of a retinoblastoma allele or region thereof from a 
person not afflicted with retinoblastoma. 

13. A method of detecting small deletions or point 
mutations in the retinoblastoma gene of a human patient 
predisposing said patient to retinoblastoma, comprising 
detecting mismatches between a nucleic acid sample from said 
patient and a detectable probe specific for the retinoblastoma 
gene from a person not afflicted with retinoblastoma, wherein 
mismatches are an indication of small deletions or mutations in 

! the retinoblastoma gene of said patient. 

14. A method of diagnosing predisposition of a human 
patient to retinoblastoma, comprising detecting the 
co-inheritance of retinoblastoma alleles of said patient !with 
DNA polymorphisms in a pedigree analysis. 

15. A method of detecting genetic polymorphisms, in 
^ the retinoblastoma gene of a human patient, that predispose 

said patient to retinoblastoma, comprising the steps of: 

generating nucleic acid fragments from a sample of 
said patient, 

separating said fragments according to a determined 
physical property of said fragments, 

hybridizing a detectable nucleic acid probe capable of 
hybridizing to the wild type retinoblastoma gene to said 
fragments, 

detecting hybrids of said probe and said fragments, and 
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comparing said hybrids to hybrids detected from the 
hybridization of said probe and separated nucleic acid 
fragments from a sample of a parent of said patient, 

wherein the co-inheritance of specific genetic 
polymorphisms with the retinoblastoma gene is an indication of 
the predisposition of said patient to retinoblastoma, 

16. The method of claim 15, wherein said physical 
property is molecular weight. 

17. A method of treating a human patient having a 
defective retinoblastoma gene comprising administering to said 
patient an anti-retinoblastoma- 

forming amount of the retinoblastoma polypeptide. 

18. A composition suitable for treating a human 
patient having a defective retinoblastoma gene, comprising 
retinoblastoma polypeptide and a pharmacologically acceptable 
carrier therefor. 

19. A method of detecting the -presence, in a tumor 
sample, of a protein the absence of which is associated with a 
neoplasm, said method comprising producing an antibody to said 
protein, contacting said antibody with said tumor sample, and 
detecting immune complexes as an indication of the presence in 
said tumor sample of said protein. 

20. A method of detecting the presence of the 
retinoblastoma protein in a tumor sample from a human patient, 
comprising the steps of: 
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contacting said tumor sample with an antibody which 
specifically reacts with the retinoblastoma protein, and 

determining whether immune complexes are formed with 
said antibody, the formation of said immune complexes being an 
indication that the tumor is not retinoblastoma and the absence 
of immune complexes indicating that the tumor is retinoblastoma* 

21. The method of claim 20, wherein said antibody is 
a monoclonal antibody. 



Abstract of the Disclosure 
This invention relates to nucleic acid, or 
fragments thereof, encoding the retinoblastoma 
polypeptide, the retinoblastoma polypeptide itself, 
methods of detecting a defective retinoblastoma gene in 
human patients, and methods of treating these patients. 
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Fig. 5-1 



GTCATGCCGCCCAAAACCCCCCGAAAAACGGCCGCCACCGCCGCCGCTGCCGCCGCGGAACCCCCGGCACCGGCCGCCGC • , ,80 
CCCCTCCTGAGGAGGACCCAGAGCAGGACAGCGGCCCGGAGACCTGCCTCTCGTCAGGCTTGAGTTTGAAGAAACAGAAG . * 1 60 
AACCTGATTTTACTGCArTATGTCAGAAATTAAAGATACCAGATCATGTCAGAGAGAGAGCTT6GTTAACTTG6GAGAAA,*240 
GTTTCATCTGTGGATGGAGTATTGGGAGGTTATATTCAAAAGAAAAAGGAACTGTGGGGAATCTGTATCTTTATT6CAGC* .320 



AGTTGACCTAGATGAG * 336 

337 ATG TCG TTC ACT TTT ACT GAG CIA CAG AAA AAC ATA GAA ATC 378 
MSFTFTELQKNiei 

379 AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC 420 
SVHKFFNLLKEIDT 

421 AGT ACC AAA GIT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG 462 
STKVDNAMSRLLKK 

463 TAT GAT^GTA TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA 504 
YDVLFALFSKUERT 

505 TGT GAA CTT ATA TAT TTG ACA CAA CCC AGC AGT TCG ATA TOT 546 
CELIYLTQPSSSIS 

547 ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT TCT TGG ATC 588 
TEINSALVLKVSWI 

589 ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA GAT 630 
TFLLAKGEVLQMEO 

631 GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC 672 
DLVISFQLMLCVLO 

673 TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 714 
YFIKLSPPMLLKEP 

715 TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA 756 

YKTAVrPINGSPRT ^ 

757 CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA 798 
PRRGQNRSARIAKQ 

799 CTA GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA 840 
LENOTRIIEVLCKE 

841 CAT GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA 882 
HECNIOEVKNVY .JF^- K 

883 AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA TCT 924 
NFtPFMNSLGLVTS 

925 AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA 966 
NGLPEVENLSKRYE 

967 GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT 1008 
EIYLKNKDLOARir 

1009 TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 1050 
LOHOKTLQTOSIOS 

1051 TTT CAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA 1092 
FETQRTPRKSNLOE 

1093 GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT 1134 
EVNVIPPHTPVRTV 

1135 ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA 1176 
MNTtQQLMMILNSA 

1177 AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC 1218 
SDQPSENLISYFNN 
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1219 TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG 1260 
CTVNPKESILKRVK 

1261 GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GOT AAA GCT GTG 1302 
DIGYIFKEKFAKAV 

1303 GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT 1344 
GQGCVEtGSQRYKL 

1345 GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 1386 
GVRLYYRVMESMLK 

1387 TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT 1428 
SEEERLSIQNFSKL 

1429 CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT 1470 
LNONITHMSLLACA 

1471 CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG 1512 
LEVVMATYSRSTS -Q 

1513 AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG 1554 
N -.:L OSGTOLSFpWIL 

1555 AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC 1596 
NVLNLKAFOFYKVI 

1597 GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG 1638 
ESFIKAEGNLTREM 

1639 ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC 1680 
tKHLERLEHRIMES 

1681 CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 1722 
LAWLSOSPLFOLIK 

1723 CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT 1764 
QSKOREGPTDHLES 

1765 GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA 1806 
ACPLNLPLQNNHTA 

1807 GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA 1848 
AOMYtSPVRSPK'KK 

1849 GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA 1890 
GSTTRVNSTANAET 

1891 CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT 1932 
QATSAFQTQKPUKS 

1933 ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC 1974 
TSLSLFYKKVYRLA 

1975 TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG 2016 
YLRLNTLCERLLSE 

2017 CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 2058 
HPELEHIIWTLFQH 

2059 ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG 2100 
TLQNEYELMRDRHL 

2101 GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG 2142 
OQtHMCSMYGICKV 

2143 AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC 2184 
KNIOLKFKIIVTAY 
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2185 AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TIC AAA CGT GTT 2226 
KDLPHAVQETFKRV 

2227 TTG ATC AAA GAA GAG GAG TAT GAT TOT ATT ATA GTA TTC TAT 2268 
LIKEEEYDSIIVFY 

2269 AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG 2310 
NSVFMQRLKTNILQ 

2311 TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC 2352 
YASTRPPTLSPIPH 

2353 ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 2394 
IPRSPYKFPSSPUR 

2395 ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA 2436 
IPGGMIYISPLKSP 

2437 TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT 2478 
YKISEGLPT PTKMT 

2479 CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG 2520 

PRSRILVSIGESFG J 

2521 ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC 2562 
TSEKFQKINQMVCN 

2563 AGC 6AC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC CCT 2604 
SDRVLKRSAEGSNP 

260S CCT AAA CCA CTG AAA AAA CTA CGC TTT CAT ATT GAA GGA TCA 2646 
PKPLKKLRFOIEGS 

2647 GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA 2688 

DEAOGSKHLPGESK ) 

2689 TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2730 
FQQKLAEMTSTRTR 

2731 ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC 2772 
MQKQKMNOSMOTSN 

2773 AAG GAA GAG AAA 2784 - _ 

KEEK 

TGAGGATCTCAGGACCTTGGTGGACACTGTGTACACCTCTGGATTCATTGTCTCTaCAGATGTGACTGTATAACTTTCC 2864 

CAGGTTCTGTTTATGGCCACATTTAATATCTTCAGCTCTTTTTGTGGATATAAAATGTGCAGATGCAATTGTTTGGGT6A ,2944 

TTCCTAAGCCACTTGAAATGTTAGTCATTGTTATTTATACAAGATTGAAAATCTTGTGTAAATCCTGCCATTTAAAAAGT 3024 

TGTAGCAGATTGTTTCCTCTTCCAAAGTAAAATTGCTGTGCTTTAT6GATAGTAAGAATGGCCCTAGAGTGGGAGTCCTG 3 1 04 

ATAACCCAGGCCTGTCTGACTACTTTGCCTTCTTTTGTAGCATATAGGTGATGTTTGCTCTTGTTTTTATTAATTTATAT 3184 

GTATATTTTTTTAATTTAACATGAACACCCTTAGAAAATGTGTCCTATCTATCTTCCAAATGCAATTT6ATTGACTGCCC 3264 

ATTCACCAAAATTATCCTGAACTCTTCT6CAAAAATGGATATTATTAGAAATTAGAAAAAAATTACTAATTTTACACATT 3344 

AGATTTTATTTTACTATTGGAATCTGATATACTGTGTGCTTGTTTTATAAAATTTTGCTTTTAATTAAATAAAAGCTGGA 3424 

AGCAAAGTATAACCATATGATACTATCATACTACTGAAACAGATTTCATACCTCAGAATGTAAAAGAACTTACTGATTAT 3504 

TTTCTTCATCCAACTTATGTTTTTAAATGAGGATTATTGATAGTACTCTTGGTTTTTATACCATTCA6ATCACTGAATTT 3584 

ATAAAGTACCCATCTAGTACTTGAAAAAGTAAAGTGTTCTGCCAGATCTTAGGTATAGAGGACCCTAACACAGTATATCC 3664 

CAAGTGCACTTTCTAATGTTTCTGGGTCCTGAAGAATTAAGATACAAATTAATTTTACTCCATAAACAGACTGTTAATTA ,'3744 

TAGGAGCCTTAATTTTTTTTTCATAGAGATTTGTCTAATTGCATCTCAAAATTATTCTGCCCTCCTTAATTTGGGAAGGT '3824 

TTGTGTTTTCTCTGGAATGGTACATGTCTTCCATGTATCTTTTGAACTGGCAATTGTCTATTTATCTTTTATTTTTTTAA 3904 

GTCAGTATGGTCTAACACTGGCATGTTCAAAGCCACATTATTTCTAGTCCAAAATTACAAGTAATCAAGGGTCATTATGG 3984 

GTTAGGCATTAATGTTTCTATCTGATTTTGTGCAAAAGCTTCAAATTAAAACAGCTGCATTA6AAAAAGAGGCGCTTCTC 4064 

CCCTCCCCTACACCTAAAGGTGTATTTAAACTATCTTGTGTGATTAACTTATTTAGAGATGCT6TAACTTAAAATAGGGG ;4144 

ATATTTAAGGTAGCTTCAGCTAGCTTTTAGGAAAATCACTTTGTCTAACTCAGAATTATTHTAAAAAGAAATCTGGTCT -^224 

TGTTAGAAAACAAAATTTTATTTTGTGCTCATTTAAGTTTCAAACTTACTATTTTGACAGTTATTTT6ATAACAATGACA .4304 

CTAGAAAACTTGACTCCATTTCATCATTGTTTCTGCATGAATATCATACAAATCAGTTAGTTTTTAGGTCAAGGGCTTAC ".4384 

TATTTCTGGGTCTTTTGCTACTAAGTTCACATTAGAATTAGTGCCAGAATTTTAGGAACTTCAGAGATCGTGTATTGAGA 4464 

TTTCTTAAATAATGCTTCAGATATTATTGCTTTATTGCTTTTTT6TATTGGTTAAAACTGTACATTTAAAATTGCTATGT '4544 

TACTATTTTCTACAATTAATAGTTTGTCTATTTTAAAATAAATTAGTTGTTAG 4597 
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SacII 
EXON 1 



SacII 



EXON 2 



Sad 



EXON 3 



EcoRI 



^aggagggag 
agtcttccct 
GTATAGCCCC 
GGGCCGCGCC 
TCCTCATGAG 
CGGCGNGNNC 
ACGCGGCGCT 

AATTATTTTT 
TCCTCCCCGG 

CAAAACCCCC 

oLysThrPro 

GCCGCCGCCG 

oProProPro 

TCTCGTCAGG 

oLeuValAr 

TAGGGCGGGC 

GGAGGAGGCG 

TCTGGGTTCT 

agnntcctga 

gttgcgagcg 

cccaaacagc 
gatatttgac 
gaccatgaaa 
taagatcttA 
AAACAAGTAT 
ATTTGGTAGG 
8 

ATTAAAGATA 
sLeuLysIle 
TGTGGATGGA 
rValAspGly 
TAAAACCAAC 
AAGctaataa 
gaggccgggc 
attgcctgag 

ctatttgaga 
atactttttt 
tttcaaatat 
TTGTATGCTG 
GAATTATTTA 

AAAAAGGAAC 
LysLysGluL 
TTCACTTTTA 
PheThrPheT 
AAGCCTCTGC 
TAGTGAGAAA 
ATTCTCTCAT 
TTTGTGTCCT 



agtggngtcc 
atcagacccc 
GTTAAGTGCA 
GGATGCCTCC 
TTAGNGTCCC 
GGGAGCCTGC 
CAGTTGCCGG 

GTAACGGGAG 
CGCTCCTCCA 

CGAAAAACGG 
ArgLysThrA 
CCCCCTCCTG 
ProProProG 
TGAGCGAGCA 

GCAAGGCGgC 
CCCTCCCTGG 
TGGGCGAGGG 
gcgaagtgac 
tncatcagac 

tttagctatt 
ttaccatgca 
aagataatca 
AAGTATTTAA 
GTACTGAATC 
CTTGAGTTTG 
• LeuGluPhftG 
CCAGATCATG 
ProAspHlsV 
GTATTGGTAA 
ValLeu 
TTCAAATCAC 
taattccatt 
acgtggttca 
ctcaggagtt 

tgactgaccc 
gccttataat 
atgccatcag 
AATAAGAAAA 
ATGAAATATT 

TGTGGGGAAT 
euTrpGlyll 
CTGAGCTACA 
hrGluLeuGl 
CATAAAAGGA 
TGGTAAAATA 
GGAGCCGTTA 
GGAAAAATTT 



ngnngagggt 
gggataggga 
CCCCGGCGTG 
TGGAAGGCGC 
AGCCNGCGCA 
GGACGTGAGC 
GCGGGGGAGG 

TCGGGAGAGG 
CAGCTCGCTG 

CCGCCACCGC 
laAlaThrAl 
AGGAGGACCC 
luGluAspPr 
GAGCGCGTCN 

TCGGCGGGGA 
CCCCCGCCAC 
GTCTCGGCTT 
aggtgcagtt 
aaaaaaatga 

acatttactt 
agcaaatatt 
tatgnnnaaa 
TAATGTTCTT 
AATTTGATTT 
AAGAAACAGA 
luGluThrGl 
TCAGAGAGAG 
alArgGluAr 
GGATTTTCTT 

TATACAAAAA 
acccagagga 
tgcctgtaat 
cgagaccagc 

ctaaagttcc 
ataaaatttg 
aaggatgtgt 
AATCAGTTAT 
TGATCTTTAT 

CTGTATCTTT 
eCysIlePhe 
GAAAAACATA 
nLysAsnlla 
AAC GAATTCT 
AAGTAAAAGA 
TGAAAGTGTA 
TGATTGTGTC 



gcactagcca gatattctgc ggggcccgag 
tgaggCCCAC AGTCACCCAC CAGACTCTTT 
GAGGGGGTGG TTCTGGGTAG AAGCACGTCC 
CTGGACCCAC GCCAGGTTTC CCAGTTTAAT 
COOACCAGCG CCCCAGTTCC CCACAGACGC 
GCGGGCGGAA GTGACG TTTT CCGCGGTTGG 
GCGCGTCCGG TTTTTGTCAG GGGACGTTGA 

"start of cDNA sequence 
ACGGGGCGTG CCCCGCGTGC GCGCGCGTCG 
GCTCCCGCCG CGGAAAGGCG TCATGCCGCC 

MetProPr 

CGCCGCTGCC GCCGCGGAAC CCCCGGCACC 
aAlaAlaAla AlaAlaGluP roProAlaPr 
AGAGCAGGAG AGCGGCCCGG AGGACCTGCC 
oGluOlnAap SerGlyProG luAspLeuPr 
CTCACGCGGG AAGGGCGCCC CGGGTGTGCG 



CCCGTCCTCG 
GGcggaGCGT 
CAACTTGACA 
ccctcttgtg 
aaaataaaaa 

tccttcacag 
tttcactgtg 
tttgaagtgt 
TTTCACAGTA 
ATAAGATATG 
AGAACCTGAT 
uGltiProAsp 
AGCTTGGTTA 
gAleTrpLeu 
AAAACGTTTT 

TTGAAAGATA 
aatttacctc 
cctaccactt 
ctgggcaaca 

acaataacta 
aatgtttgtt 
tacaa&tata 
AATACAGTTT 
TTTTTGTTCC 

ATTGCAGCAG 

IleAlaAUV 

GAAATCAGGT 

Gl uIleS e 

GGATTTTCCT 

AAAAGAACTT 

TTTATGCTGT 

TCTCACTATT 

4*6 kb-- 



CCAGGGgCCG GGTCCcgGNG 
CTGCAGAATG GTGACAGGAT 
GGTGTCGGGC GGGTggggct 
agttctcggan ncagaggntc 
tacaaaaa 



aagtgttttg 
tggtatcctt 
aatgtttttc 
GTGTTATGTG 
CCAATTATAT 
TTTACTGCAT 
PheActAlaL 
ACTTGGGAGA 
ThrXrpGluL 
GAAATTTTTT 



ctgctttgaa 
attttggaat 
taagataaaa 
CAAACTATTP 
GATTATTTTC 
TATGTCAGAA 
euCysGlnLy 
AAGTTTCATC 
ysValScrSe 
TTTCTCATT't 



GAAAAATATA AAGACAATAA 
tgctaacatt aaaaatgttt 
tgggaggctg aggcaggtgfe 
tggt 



tttaattttt 
attagtgtga 
cagtATTACA 
TAACATAGTA 
AGGGAGGTTA 
GlyGlyTy 
TTGACCTAGA 
alAspLeuAs 
AAAGTTTCTT 



tatctttcta 
aatgaaatcc 
AACATTTATT 
TCCAGTGTGT 
TATTCAAAAG 
rlleGlnLys 
TGAGATGTCG 
pGltiMetSer 
GTATAAATAT 



CTCAATAGAC TTTTGTGAAT 
GGACCAAATA GTGAACTGOC 
ATTTCTTTAA GAGGTAGCA^ 
CATGTGTAAG C 



060 
120 
180 
240 
300 
360 
A20 

A80 

SAO 

(3) 

600 
(23) 

660 
(A3) 

720 
(A6) 

780 

SAO 

900 

960 
1008 

060 
120 
180 
2A0 
300 
360 

(63) 
A20 

(83) 
A80 

(88) 
SAO 
600 
660 
70A 

060 
120 
180 
2A0 
300 

(9A) 
360 

(UA) 
A20 

(127) 
A80 
SAO 
600 
651 



Fig. 6-2 



gcataggtat atagataata gaggtgtaag ttgaaggcta attatttttg caaaaagtaa 060 
ttccttccaa aggatatagt agtgatttga tgtagagctg ataatcTTTT GAATTG/JkAT 120 
ATCTATGATT TGAAAACGAA ATAACACAAA TTTTTAAGGT TAGTGATTTA CTTTTTTGTA 180 
EXOH A TTCTTTCCTT TGTAGTGTCC ATAAATTCTT TAACTTACTA AAAGAAATTG ATACCAGTAC 2A0 

rValH isLysPhePh eAsnLeuLeu LysQlullcA fipThrS«Th (U2) 
CAAAGTTGAT AATGGTATGT CAAGACTGTT GAAGAAGTAT GATGTATTGT TTGCAGTGTT 300 
rLysValAsp AsnAlaMatS erArgLetiLa uLysLysTyr AspValLeuP heAlaLeuPh (162) 
CAGCAAATTG GAAAGGTAAA GTAAACATTT TATTAGGGTT ACACTCTGAT TTTTTATGTC 360 
eSerLysLeu GluAr (167) 
ATTGTTCACA ATTAGATTCT GGGAATTATT TAACACATTT AGTAAAGTTA GTAAGTATTA A20 
ATTCTTAgac ttgtcccttt taatgttagc tcattaattc ttagctttct tatttatcca A80 
gtaa^tatgca ttctgaatgc ttcctggaaa attaaecgtt attatccttt catgtctcca SAO 
tttgttttca aaacttagct tatcgagtat 570 

—2.1 kb-- 

gagatattta aagagnaact ttactaacct taggtggatc agctgggtgt tttctatctt 060 
atttatacct tttttttgaa GACTAATTGA GAGGATTAAC TGTAATTATA TATTAAAGTG 120 
ATGTGAGATG TCATAAATTG GGAAAATCTA CTTGAACTTT GTTTTATAAT GCTATATATT 180 
TTTTGTTTTT AAAATATATA CTTCTTAAAA GAAGATGMT AAAGCATGAG AAAACTACTA 2A0 
EXON 5 TGACTTCTAA ATTACGAAAA AATGTTAAAA AGTCATAATG ITmcmT CAGGACATGT 300 

gThrCys (169) 

GAACTTATAT ATTTGACACA ACCCAGCAGT TCGTAAGTAG TTGACAGAAT GTTATTTTTC 360 
GluLeuIleT yrLeuThrGl nPr.oSerSer Se •* (180) 

ACTTAAAAAA AAAGATTTTT • ATGGAATAAT CTCAAACATC TTGATAGTTA GGGTTAGTTT A20 
GATCGATTAT AGCAGGCTAC Ttcataaatt aagcccatag atttaagtcc tgtgtagatt A80 
atttatcttc tcacaaagaa aatagtataa aatacatgcc .ttgtactaca aagaagaact SAO 
aataaggtgg aattgattca ggacagcata tcaccaactc tgagaaaaat gcaacaaatg 600 
caaattcatt gactaa * * *^ 616 

—1,4 kb- 

aaatggactg cattctatta tgcatttaac taaggtcatt ttttttttaa tGCACAAAAA 060 
GAAACACCCA AAAGATATAT CTGGAAAACT TTCTTTCAGT GATACATTTT TCCTGTTTTT 120 
EXON 6 TTTCTGCTTT CTATTTGTTT AATAGGATAT CTACTGAAAT AAATTCTGCA TTGGTGCTAA 180 

rlleS erThrGlull eAsnSerAla LeuValLcuL (192) 
AAGTTTCTTG GATCACATTT TTATTAGCTA AAGGTAAGTT CATTATATTT ATTAAATGCT 2A0 
ysValSerTr pIlsThrPha LfttjLeuAlaL ysG * r--- (203) 
AATATTTCAA ATGTAATAAT TAAATTGGCA mCTTTGGA CTAAATTCCC CAATTTTTAT 300 
TGAGTAATGT ACTCCTccct cattctctgc ttggcttatt aactgttagc aagttcctat 360 
aattctggta ctagaaacaa ccttggaaat gctttattta atntttgttt ctaatattcc A20 
atcttccctc cctt A3A 

—11.5 kb— 

tttatagtga ttttagacat aaagaattaa ttataacaga aatagcttaa atgtaaaatt 060 
ctcagagtag agcttaacac ttgatttata attccataac tttacatatt tCTATTTTAC 120 
ATATTTTATA CCTTTTAAAA CAGATTTTTT TTTTTTTTAC AAAAAAAAGA AAGAAAATCT 180 
TTACCATGCT GATAGTGATT GTTGAATGAA TAAATTTATG GATATACTCT ACCCTGCGAT 2 AO 
TTTCTCTCAT ACAAAGATGT GAATGTCTAA CTTTCTTTAA AAATGTACAT TTTTTTTTCA 300 
EXON 7 GGGGAAGTAT TACAAATGGA AGATGATGTG GTGATTTCAT TTCAGTrAAT GCTATGTGTG 360 
lyGluValL euGlnHetGl uAspAspLeu VallleSerP haGlnLeuMa tLeuCysVal (222) 
CTTGACTATT TTATTAAAGT CTCACCTCCC ATGTTGGTCA AAGAACGATA TAGTAAGTAT A20 
LeuAspTyrP helleLysLe uSerProPr.o MatLariLeuL ysGluProTy rL (2A0) 
TTAATTTATG CCCCTTTTAC TTTCTGATTC AGGAGTTGCT TATTGAATGT CTAGTGGGTA A80 
CCAAACATGG TTCTAAGGCT GACAGGATGA TAAAAAATAA ATCAgacatg gactttgcc.c SAO 
ataagtagtg taagttatag aaggaaagat aagacatgga aacaaatgat tagagtatat 600 
ggtagaaagt ggtttcgggt caaaatacaa caaatggagg tttgggagac aagaag : 656 

— 1.8 kb~ 



Fig. 6-3 



EXON 8 



EXON 9 
EcoRI 



Hindlll 



EXON 10 
Xbal 



gctattccat 
gannagagta 
ATGAAATTGT 
TTATTTTTGA 
TGTTCTTATC 

fCGAACACCC 
oArgThrPro 
TACAAGAATT 
pThrArglle 
ACTTCATGAT 
TTTACTTCTT 
CAAAATATTT 
cctcactct 

aagcattgaa 
ggcaatatag 
ATTTTTTACT 
AGATTAGATT 

AAATGTTTAT 
sAsnValTyr 
TGGACTTCCA 
nGlyLeuPro 
GAGGGAGGAT 
GCCCAAGAAG 
aagctt 

gtagcattgg 
GACCTAAAAT 
TTATTTATAT 
ATTCTTTAAT 
TTGTCAGTGA 

TCTTAAAAAT 
tLeuLysAsn 
TGATTCTATA 
rAspSerlle 
GATACTGATA 
ACAAATTACT 
cttatctaca 
gttaactgaa 



gccttctctt 
gaagagggat 
TATCCTTCTA 
CCTAAGTTAT 
TAATTTACCA 

AGGGGAGGTC 
ArgArgGlyG 
ATTGAAGTTC 
IleGluValL 
TTCTTTAAAA 
TTGTTATGAG 
AGAAAAAtta 



tgtatttgtt 
gCAAAAACTA 
ATGAAACCTA 
AGTTAGAATA 
CTTTTACAGA 

y 

AGAACAGGAG 
laAsnArgSe 
TCTGTAAAGA^ 
euCysLysGl' 
CAGTTAAAGT 
CATGTTTTTT 
attcgttata 



tatgagactg 
ATATTAGTAC 
ATAAGTAAAA 
CTTCATTATT 
AACAGGTGTT 
sThrAlaVal 
TGCACGGATA 
rAlaArglle 
ACATGAATGT 
uHisGluCys 
AGATTXAGAT 
TT6TAATTAG 
tttagttact 



tagtttacag 
ATAATTTGTA 
GTAGTAGAAT 
TTATATGATG 
ATACCCATTA 
IleProlleA 
GCAAAACAAC 
AlaLysGlnL 
AATATAGATG 
AsnlleAspG 
GTAAGTTCTC 
TGCTAACTCT 
ttgatttaag 



gctgtaatgc atgtgattgc acctgtgaat agccactaca 
agagacccct tctcTAAGAA AATAATAAAA AATAAAAAAG 
GCATGGGGGA TTGACACCTC TAACTTACCC TGCATTGTTC 
TTGTTTTAAA TTTTAATGAT CATGTTGTAA CTTCATCTTT 



TTCAAAAATT 
PheLysAsnP 
GAGGTAATCT 
Glu 

AATTGTCTAA 
AATCTAGCCA 



TTATACCTTT TA TGAATTCT 
helleProPh eMetAsnSer 
GAAAGGAAAT TTAATAAAAT 

CTTTCTTAGA TCAATTTACT 
AGTAGAATTG TGGTGAAACT 



ctatctttgt 
CAAAGTTGAA 
TGCATGCGAA 
GAAATCTGTG 
CTTTTTTCTT 

AAAGATCTAG 
LysAspLeuA 
GACAGGTATT 
AspSe 

TAGGTAGATT 
TTCAAATGTC 
aatatJagatt 
tataattttt 



ctacataaaa 
CAAATGTTGC 
CTCAGTGTAT 
CCTCTGTGTG 
TCAAGGTTGA 
ValGl 
ATGCAAGATT 
spAlaArgLe 
GCACATGGTA 



1.9 kb— 

ttctaataaa 

AATTTTCTGT 

ATTACAAAAT 

CTGAGAGATG 

AAATCTTTCT 

uAsnLeuSer 

ATTTTTGGAT 

uPheLetiAsp 

TATTTGATTG 



CTTGGACTTG 
LeuGlyLeuV 
ATTAATGTTT 

GTGTATCACA 
AACTTTTGTA 



tattttctat 
ACCTCACTTT 
TAAATGTATA 
TAATGACATG 
AAACGATACG 
LysArgTyrG 
CATGATAAAA 
HlsAspLysT 
ATTTGCTTTA 



ttctttttgg 
GTAGATATGG 
GTTACCAAGA 
GATGTACAAT 
ATGGTTCACC 
snGlySerPr 
TAGAAAATGA 
euGlxiAsnAs 
AGGTAATTTA 
lu 

CCTAACAATA 
TTTGCAGTAG 
agagtagctc 



cttcagccta 
TTATACACAG 
AAGAGTCAXG 
TTCAGGTGAA 
ValLy 
TAACATCTAA 
alThrSerAs 
TGAGACTGTG 



ATATAGTCTT TAGCTTAGTG ACCTTTAGAT 
TTTATACAAA GAAAAGTTTA ACAGTATTTT 
taatgtgaat tgtgtgtcct ataacagtta 
aaaatgtgca ccaaaagata atggcta 
— 1.0 kb— 



TAGTAacaaa 



gcacgAAATA 
TAGATAGACC 
TTATACAAAA 
TAAAGGATAA 
AAGAAATTTA 
luGluIleTy 
CTCTTCAGAG 
hrLeuGlnTh 
GATATAGGTT 

ATCATTTATA 
AAGcatataa 
cctttttnca 



060 

120 

180 

2A0 

300 
(250) 

360 
(270) 

A20 
(287) 

A80 

5A0 

600 

609 

060 
120 
180 
2A0 

(289) 
300 

(309) 
360 

(313) 
A20 
A80 
A86 

060 
120 
180 
2A0 
300 
(325) 

360 
(3A5) 
A20 
(350) 
A80 
5A0 
600 
6A7 



Pig. 6-4 



t 



EXON 11 



EXON 12 



EXON 13 



EcoRI 



060 
120 
ISO 
2A0 
300 
360 

(367) 
A20 

(326) 
A80 
5A0 
600 
660 
680 



aatactgaac aacttggtta tcaataccnc cagggagaag catctgactt tcacttttaa 
aaaaagactt aatgattggt atacctcttt gtcataaaca taatggaaag agacccacaa 
ttaaaaagng tagtgAAAGG TATTTTATTT AAGCAGCAGC TGG6TGATCT ATTTTCTATC 
CTATCTATTA TTGAGTTATC ATTTTATATG ATTTTATGAG ACAACAGAAG CATTATACTG 
CTTTTTTGAT GCATAAAGCA CAAATTGTAA ATTTTCAGTA TGTGAATGAC TTCACTTATT 
GTTATTTAGT TTTGAAACAC AGAGAACACC ACGAAAAAGT AACCTTGATG AAGAGGTGAA 
r PheGluThrG InArgThrPr oArgLysSer AsnLetiAspG luGluVaUs 
TGTAATTCCT CCACACACTC CAGTTAGGTA TGAATTTTCC TACTTTTAAT TATATTATAA 
nValllePro ProHisThrP roValAr 

TTTTGTTATT CATGGCTTTA TAGTGTTTCA GATTTGTTCA CGTTTCTTTA TGTATTCATA 
CATACATGTA AGAAATATAT ATTGAAGGCC AGGTGTGGTG GATCACACCT GTAATCCCAG 
CACTTTGGGA GGCCAAGGCG GGCAGATCAG CTGAGGTTAG GAGTTTGaga ccggcctggc 
caacatggtg aaaccccgtc tctactagaa atacaaaaat tagctggggg tggtggtgtg 
tgcctgtaat ccagctgctc ■ 

"3.2 kb— 

caataccatt ttgttgccag ttatatagtt ctcctaaaaa taatgccACT ATTTTATTGA 060 
TATGTAGTTT TATTAGTAAA TAAGTATATC TGTTCTATAA CTATAAACTT ATTGATTGTG 120 
AATACATATT TTCTTAAAGA TTTAAGTAAA ATGTAATTTC TTATAAACCA CAGTCTTATT 180 
TGAGGGAATG TAGAGACAAG TGGGAGGCAG TGTATTTGAA GATACATTTA ACTTGGGAGA 2A0 
TTGAAAACAT TTCATTTTTT CTTTTTTTCT CCCTTCATTG CTTAACACAT TTTCCTATTT 300 
TTATCCCCTC TAGGACTGTT ATGAACACTA TCCAACAATT AATGATGATT TTAAATTCAG 360 

gThrVal MetAsnThrl leGlnGlnLft xiMetMetlle LeuAsnSerA (392) 
CAAGTGATCA ACCTTCAGAA AATCTGATTT CCTATTTTAA CGTAAGCCAT ATATGAAACA A20 
laSerAspGl nProSerGlu AsnLauIlcS crTyrPheAs n ^ (AOS) 

TTATTTATTG TAATATCTTG GCAAAGAAAC TTGAAATTAA AAGTT AAAG T ACTGAGT'fCT ABO 
TTTTAAAATA CTAATCTCCT ATCTAACATG TAGTTATCCA TAATCTTTTC TTGCTTTTTT SAO 
AATCTTACAA ATTATATATT ATTAGTAGTA TTGTTTTATT TATACAGTGT TATTTAAAAG 600 
ATTTTTATGT TTACCTATTT GCCTTgctca ccattcttcc ttcgaactta tgcctcaQtt 660 
ctgagataat tttttcttct tcagatatat cctttgataa ttac 70A 

-3*1 kb-- 

aaaatttaga taatagggtt ttttagttgt actgta^tat tttttgctcg attaacatcc 060 
AGTGAAATGA TATTGTCTGC TTATGTTCAG TAGTTGTGGT TACCTAGTTA.TTATGGAAGT 120 
GTTTCCACAT TTTTATGAAC AATTTAAAAA QTCATATATT ATCfGAGCAGA AAATATTAAT 180 
TCTGATTACA CAGTATCCTG GACATTGATT TCTGTTTTTA CCTCCTAAAG AACTGCAGAG 2A0 

AsnCysThrV (A09) 

TGAATCCAAA AGAAAGTATA CTGAAAAGAG TGAAGGATAT AGGATACATC TTTAAAGAGA 300 
alAsnProLy sGluSarlle LeuLysArgV alLysAspll eGlyTyrlle PheLysGluL (A29) 
AATTTGCTAA AGCTGTGGGA CAGGGTTGTG TCGAAATTGG ATCACAGGTA ACTT GAATTC 360 
ysPheAlaLy sAlaValGly GlnGlyCysV alGluIleGl ySerGln (AAA) 
ATTGTAATTC GTGGTACTAT AGAGTAATAA TATTAAAAGC AGCATCTTTC CAGTTCGTAT A 20 
AAATACTCTA ACAGTATTTG TCTAGTAGTA TAAAATACTG TCAGATACTA TATCCCTGCT A80 
GCCTGTGTAT GCTGCTATTT ATGGGAACTT TATGGAAAAC TACCTCCCAc cccattataa SAO 
aaactatgta ataaaggaac acatagccat tgtagaaatt ttng 58A 

—1.8 kb" 



Fig. 6-5 



EXON lA 



EXON 15 
EXON 16 



EXON 17 



gaatgttaat 
gtcttggcgg 
AAAATTTTAG 
GAGGCCAAAG 
GTGAGACTCC 
GCTCTTATTT 

CGAGTAATGG 
ArgValHetG 
CGGGCGCGGT 
GGAGGfCAAG 
TACAAAAATT 
GGCAGGAGAA 
TCACCCCAGG 
ATTCAATGCT 
TCTGTTTCAG 

GTAAAAAATT 
AGCAAACTTC 
rLysLeuL 
GTAATGGCCA 
ValMetAlaT 
AAGTTAAAAT 
AGGAATGCTT 
TAAGAGGCTT 
ttctgagtat 
cctttgagat 

ggtatttaaa 
gaTTTGCATA 
AAAATTCTGC 
ACTTCCAAAA 
CATTTTTAGG 
r 

TTCTGAATGT 
leLeuAsnVa 
AAGCAGAAGG 
ysAlaGluGl 
TCATGGAATC 
leHetGlySe 
GTGCATATGG 
AACAAATAGG 
atattagaat 
ttaagaatag 



caccacttaa 
ccatatttgt 

taattgtcag 
caggaggatc 
atctcaaaaa 
ttctttttgt 

aatccatgct 

luSarMetLe 

GGCTCACGCC 

GCATCAAGAT 

AGCTGGGCGT 

TCCCTTGAAC 

CTGGCAACAG 

GAGACAAATA 

GAAGAAGAAC 

GluGluGluA 

TTTTTCTTTT 

TGAATGACAA 

euAsnAspAs 

CATATAGCAG 

hrTyrSerA 

GTGGTGTGTT 

ATTTTAGATC 

ATTTGAGTTA 

actggtatac 

tgtagagtgc 

tctttgaaaa 
CACTCAAAAT 
TCTAAATAAA 
AAATACCTAG 
AAGTACATCT 
gSerThrSer 
GCTTAATTTA 
ILeuAsnLcu 
CAACTTGACA 
yAsnLeiiThr 
CCTTGCATGG 
rLeuAlaTrp 
CTAACAAATT 
GAATTTAATG 
ggagctttaa 
cttttcttaa 



tacttaagtt 
aagaagggtg 
CTGGGTATAG 
TCTTGAGCCC 
AAAAAAAAAA 
TTGTTTGTAG 

TAAATCAGTA 

uLysSer 

TGCAATCCCA 

CATCCTGGCC 

GGTGGTGTAG 

CACGGAGGTG 

AGCAAGACAC 

AGGTTTCAAT 

GATTATCCAT 

rgLeuSerll 

TATAGAAGTA 

CATTTTTCAT 

nllePheHis 

TAAGTTAAAT 



gtgagtttta 
AGAAGTATGT 
TGGTACATGC 
AGGAGTGTGA 
TTTCATAATT 
CGATACAAAC 
ArgTyrLysL 
AGTTAAAAAC 

GCACTTTGGG 
AAAATGGTGA 
ACCTGTAGTC 
GAGGTTGCAG 
CATCTAAAAA 
TAAACAACTT 
TCAAAATTTT 
eGlnAsnPhe 
AGTATTTTAT 
ATGTCTTTAT 
MetSerLftuL 
TTTCATAAAT 



gacaagctng 
TTTAAGAAAA 
CTATAATCCC 
AGGCCAGCCT 
GTGATTTTCT 
TTGGAGTTCG 
euGlyValAr 
AATAATAAAA 

AGGCCGAGGT 
AACCCTGTCT 
CCAGCTACTT 
TGAGCCAAGA 
A AAAAAAAAA 
CTTTTTTTTT 
AGGTAAATTT 
Se 

AATCTTTTTT 
TGGC6TGCGC 
etiAlaCysAl 
AAACACTTTT 



cttttgtgtt 
GGCTTTTTAA 
AGCCTCTTGG 
GGGCAAAACA 
AAAATAGCAG 
CTTGTATTAC 
gLeuTyrXyr 
AAATTTCANC 

GGGCAGATCA 
CTACTAAAAG 
GGGAGGCTGA 
TTGTGCCATT 
AAAAAAAAAA 
TTTTAAATTA 
TTTACTTTTA 

TTTTTCCTTT 
TCTTGAGGTT 
aLeuGluVal 
GTTCAATTTA 



TCTTTGGTCG GGGGAGAGGG ATAGTGTGAG GTTAAGGAGA 
ACTATATACT. GAAGAATGTA ATTGGTCATT ATAAGCCATT 
TTTGAggcca tcttggggat aatatttcac taggcttctc 
tgaatccaaa aaaggtactt tttcgaaatc cctccgaaga 



tttgagatca 

TGGAAGGCTA 

AATGGTTTAA 

CTCAAGGGTT 

CAGAATCTTG 

GlnAsnLauA 

AAAGCCTTTG 

LysAlaPheA 

AGAGAAATGA 

ArgGluMfttl 

CTCTCAGTAA 

LeuSer 

ATTGTTAGTG 

AATAATGTTA 

tcgagcaccc 

aacatgccac 



•1.0 kb— 
gctataagtc 
TTTCCTATGA 
CCTTTCTACT 
AATATTTCAT 
ATTCTGGA4C 
flpSerGlyTh 
ATTTTTACAA 
spPheTyrLy 
TAAAACATTT 
leLysHisLe 
GTAGCTAAAT 



ctttctctag 
GTCCGTAGAC 
GTTTTGTTTG 
AAATAGTTAC 
A6ATTTGTCT 
rAspLeuSer 
AGTGATCGAA 
sVallleGlu 
AGAACGAT6T 
uGluArgCys 
AATTGAAGAA 



gaaaaacaca 
TCCAAAATAA 
TCTGATAATA 
TTTTTTTTTT 
TTCCCATGGA 
PheProTrpI 
AGTTTTATCA 
SerPhelleL 
GAACATCGAA 
GltiHisArgl 
ATTCATTCAT 



AGAGGTGTTT CTTAACAAAT CTACCTCAAG 
TTTCAGTCTA TAGCCCAAGG ATCAagtgga 
taaaccatct aatacagcnc agtgatttat 
ttt 

70*0 kb— 



060 
120 
180 
2A0 
300 
360 
(ASA) 
A20 
(A63) 
A80 
SAO 
600 
660 
720 
780 
SAO 
(A7A) 
900 
960 
(A93) 
1020 
(500) 
1080 
llAO 
1200 
1260 
1280 

060 
120 
180 
2A0 
300 

(517) 
360 

(537) 
A20 

(557) 
A80 

(565) 
SAO 
600 
660 
693 



Fig. 6-6 



attttctaat ataagcgttg aaggtta^tac atttttctac ttttttgtgt gtgggaagta 060 

caaaaattgt CAATTGGGAA TTTCGMGTA GAGAAAAATA TTTCATTCTG ACTTTTAAXT. 120 

TGCCACTGTC AATTGTGCCT AAAATTCATA GTACTTACCA TGTCAAACAA TATGATTTTG 180 

ATATGTACCT GGGAAAATTA TGCTTACTAA TGTGGTTTTA ATTTCATCAT GTTTCATATA 2A0 

EXON 18 GGATTCACCT TTATTTGATC TTATTAAACA ATCAAAGGAC CGAGAAGGAC CAACTGATCA 300 

AspSerPro LeuPheAspL euIleLysGl nSerLysAsp ArgGluGlyP roThrAspHi (585) 

CCTTGAATCT GCTTGTCGTC TTAATCTTCC TCTCCAGAAT AATCACACTG CAGCAGATAT 360 

sLeuGluSer AlaCysProL euAsnLeuPr oLeuGlnAsn AsnHisThrA laAlaAspMe (605) 

GTAAGGAAAA TATATGTTAT GTTGACCATC AAAGTGCAAA TAGATTTTAA GCATAAGTGC A20 

AATGTAACAT Tf-^ATAAAGA V ''^GTAGGG AATAGAATTT TGAATAAGA.* TAGTTTCTGT A80 

TTTTAAGAAA TTAGTAATAA AAUGTACATg acccaaataa agtcatataa aagagtacag SAO 

agtgctactg aatcacctag gatttgcata atgagagcag ttttcatgg 589 

—3*0 kb" 

tgtttttaag ctggaatcac cttatggtct caataccact ataattatta aaattgTACA 060 

TTATACATAT ATAGCTATTT TTTTCTAATA AGGCAGTAAT CCCCAGGAAA AGCCATTTAT 120 

TAAAATAGAA TTAGATATGA TGATGACAAG CAGTTTTCCT ATTAATATAT CTTTCCCAGC 180 

TTGCATTTAA ATAGTCTGCT ATAATACCAA TTAAATAGAC AAGATGTATC TGGGTGTACA 2 AO 

ACCTTGAAGT GTATGTATAA TCTGTGATTC TTAGCCAACT T6AAATGAAG ACTTTTCCTT 300 

EXON 19 TAAATATATC TAGGTATCTT TCTCCTGTAA GATCTCCAAA GAAAAAAGGT TCAACTACGC 360 

tTyrLeu SerProValA rgSerProLy sLysLysGly SerThrThrA (621) 

GTGTAAATTC TACTGCAAAT GCAGAGACAC AAGCAACCTC AGCCTTCCAG ACCCAGAAGC A20 

rgValAsnSe rThrAlaAsn AlaGluThrG InAlaThrSe rAlaPhcGln ThrGlnLysP (6A1) 

CATTGAAATC TACCTCTCTT TCACTGTTTT ATAAAAAAGG TTA6TAGATG ATTATTTTCA A80 

roLeuLysSe rThrSerLau SerLetiPheT yrLysLysV (65A) 

AGAGCATGGA CTCTGAAACT AGGGTGACTG GGTTCAAATC ATGTTTCTTC TACTTTCTAb SAO 

GTAGATTACT GGGCAAGTCA CTTAATATCT CTGTGTCTCA GTTTCCTcat ctataaaatg 600 

gaaatgataa tgttgcgaga tctttcttga ctattcagag tcgttttctg 650 

aaggaaaatc catgccccnt cgggacatgc ctgncctctg catttcttca tctgtatcc^: 060 

ttgtaatatg cctcataata aaccagtaaa catGTTTCTC TGGGGGAAAG AAAAGAGTGG 120 

TAGAAAAGAG GTTTCTGTTA AAATGCTACT TAACAGCATT ATAATTAGTG TAATTTCATG 180 

ATTTGAAAAA AATCTACTTG TAATTCAAAA TGAACAGtHL' AAATGACTAA TTTTTCTTAT ' 2A0 

EXON 20 TCCCACAGTG TATCGGCTAG CCTATCTCCG GCTAAATACA CTTTGTGAAC GCCTTCTGTC 300 

al TyrArgLeuA laTyrLeuAr gLeuAsnThr LeuCysGlxiA rgLeuLauSe (671) 

TGAGCACCCA GAATTAGAAC ATATCATCTG GACCCTTTTC CAGCACACCC TGCAGAATGA 360 

rGluHisPro GluLeuGluH isIlellaTr pThrLeuPhe GlnHisThrL euGlnAsnGl (691) 

GTATGAACTC ATGAGAGACA GGCATTTGGA CCAAGTAAGA AAATCAAGCA CTTCACCTTC A20 

uTyrGluLeu HetArgAspA rgHlsLeuAs pGln (702) 

TCTCCTCCCT ACTTACTTGT TAACTGATTT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT A 80 
TTCTTTCTTT CTTTCTTTCT TTCTTTTCTT TCTTTTCTTT CTTTCTTTCT TTCCTTTTTT SAO 

TTTTTTTGAG ATAGAGTCTC ACTCTGTTAC CCAGGCTGGA GTGCAGTGGC GCAATCTCGG 600 

CTCACTGCAA CCTCCGCCTC CCAGGTCAAG TGATTCTCCT GCCTCAGCCT ccnaggagct 660 

aggatacagg cgtgtaccac cacaccttgt taatttttgt tatttagtag agacagg 717 
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EXON 23 



EXON 2 A 
Hindlll 



caagagccaa 
ggtgatttgC 
AACT GAGCTC 
AAAGAAAGAA 
ACATCAATTT 

TATAGACCTT 

nlleAspLeu 

GGAGGTAGGT 

nGlu ^' 

AGGTAATTCA 

ATGTAAATTC 

AGATAAGGGT 

TTTTATCCCT 

atatcttgag 

tccatctgct 
gagcaGCTAT 
ATATGTGCTT 
AGAAATTTTA 
ACTGTTCTTC 

ATAGTATTCT 

IleValPheT 

TCCACCAGGG 

SerThrArg 

AAAACATTAA 

ATACCTCACA 
IleProHisI 
GGGAACATCT 
GlyAsnlleT 
CCAACAAAAA 
ProThrLysM 
TGAGAGGGGG 
AGCTAGACTC 
GGTGAAGAAA 
gtctacagct 
taccgggttc 

ttgataactt 
tgccctccct 
TAAAACTAAG 
TGAATATAGT 
GTTCAGAATG 
TTGGTATTTC 

GTATTTTCTT 
TCTATAATTT 
AATTTTATGA 
ACTCCTAATT 
ATATACATTT 
ATCACCAAAT 
ctacaattea 



agttagggta 
ATTTTGTTCT 
AGTATGGAAA 
AATGGTATTT 
ATTTACTAGA 
I 

AAATTCAAAA 
LysPheLysI 
AATTTTCCAT 



atttacaaac 
TTAAACACAC 
GAAATAACTC 
TTTAAGAACA 
TTATGATGTG 
leMetMetCy 
TCATTGTAAC 
lelleValTh 
AGTAAGTTTT 



caggtgatca 
TTTGGGTTAA 
TGTAGATTAA 
AAACCATGTA 
TTCCATGTAT 
sSerMetTyr 
AGCATAGAAG 
rAlaTyrLys 
TTTGATAAAT 



gtcctggata 
ACACTTCATG 
ACCTTTCTTT 
ATAAAATTCT 
GGCATATGCA 
GlylleCysL 
GATCTTCCTC 
AspLeuProH 
CCATATCCAT 



attgagcctt 
TAGACTTTCA 
TTTGAGGCTA 
GACTACTTTT 
AAGTGAAGAA 
ysValLysAs 
ATGCTGTTCA 
isAlaValGl 
AACATAACAT 



TTTGATCTCA TTTATCATTA ATGAGATCAT ATATTCTGTC TGACCTTATT 
ACAAATAAAA ACTTTTATAT TATTTATTTG TAACTTAAAT AGAATTGGAA 
AATTATGAAA TTACCCATAT CATAGTTTTT TATAAAGTTA ATAAATAATA 
GTAATAAGCA GGTATTTGTa ataaacttga catgagtcat agaacattag 



gctgcctggc 
AATCCAAGCC 
CTTACCAGTG 
AAATTCATTT 
CTCAGACATT 
TKrPh 
ATAACTCGGT 
yrAsnSerVa 
TAGGTCAAAA 



tatttctctc 
TAAGAAGTAA 
AAAAAGTATT 
AACAAGTAAA 
CAAACGTGTT 
eLysArgVal 
CTTCATGCAG 
IPheHetGln 
GTATCCTTTG 



'0.2 kb— 

aatcgattct 

TTTTATTTAT 

ATAA AC TATT 

TTTTACTTTT 

TTGATCAAAG 

LeulleLysG 

AGACTGAAAA 

ArgLeuLysT 

ATTGGAAAAA 



ATAAATAATC TACTTTTTTG TTTTTGCTCT 



TTCCTCGAAG 
leProArgSe 
ATATTTCACC 
yrIleS«Pr 
TGACTCCAAG 
etThrProAr 
ATTATTTTGA 
TTGAAACTCT 
ATCTATTTAC 
cccagcgtga 
atctcactag 

acccattgat 
aaatatgggc 
AGACTAGGTG 
TTGTCAGTGG 
ATGTATTTAT 
ATCTTAACTT 

TCTATGAAAT 
GAATTTCCAA 
GATATATATA 
GGTGAACTTG 
GTGTATATAC 
GTGTGAGATT 
atteaattct 



CCCTTACAAG 

rProTyrLys 

CCTGAAGAGT 

oLeuLysSer 

ATCAAGGTGT 

gSerAr 

TCCAAGAATA 

ATTTGCTTAT 

GACTTAAGCA 

gcacgcagaa 

ggagtgccag 

ttatgaagaa 
aatggcagna 
AGTATATGAT 
TTCTAGGGTA 
GCTCATCTCT 
GAGAGAATCT 
glleL 
ATAATAGTAT 
ATGCAGTTAT 
TCTCAGATTT 
AAAACTTTTC 
ATACATATAT 
TTTTTCTCAC 
gacactaatt 



TTTCCTAGTT 
PheProS^S 
CCATATAAAA 
Pro TyrL yisI 
GTGTTTTGTC 

AAAAATATAA 
TTAAGTAACA 
ACGCAAGATG 
gacgggtgat 
acagt 
7. A kb— 
ctaagtaggg 
tatgttcttg 
TAGACGGGCA 
GAGGTAACCT 
GCAAAATTGT 
TAGTATCAAT 
euValSerll 
GCATTGTAAG 
TCAAACACCT 
ACTTTCAAAT 
TAAATATCCA 
ACGTGAGCTG 
CCAAATCTAT 
Acccagag 
•2.8 kb— 



gtgacatttc 
TTATTATTTT 
AGAAAAGAAA 

AAGAGGAGTA 
luGluGluTy 
CAAATATTTT 
hrAsnlleLe 
TCTAATGTAA 

AGCCCCCTAC 
ProProTh 
CACCCTTACG 
erProLetiAr 
TTTCAGAAGG 
leSerGluGl 
TTTAGGGAAG 



act tctaga a 
TTCCTTTATA 
ATGTAAAGGT 

TGATTCTATT 
rAspSerlle 
GCAGTATGCT 
uGlnTyrAla 
TGGGTCCACC 

CTTGTCACCA 
rLauSerPro 
GATTCCTGGA 
glleProGly 
TCTGCCAACA 
yLeuProThr 
TAGTAAAGAA 



AGCATTCTTC ATTTCAAATA 
TAATAAGAAT ATGGGGGGGG 
GCCGAATAGG AACAGCTCCg 
ttctgcattt ccatctgagg 



gtaaccttga 
cagacctata 
CTGTTAGAAT 
TTAATTTGGT 
ATATGGTTTT 
TGGTGAATCA 
eGlyGltiSer 
TATAAAAGAA 
CATCCAGGCA 
CAAGTTTAAT 
CTTGA GATTA 
TTTTTGCTCA 
TCTTcaactc 



aacttgcctt 

acttttgcTT 

AATTCCCAAA 

ATTCCTAATA 

TTATTACTAA 

TTCGGGGTGA 

PheGly 

ATT AAAGCTT 

TATTGCATAG 

CTCAAATCAT 

TATAATACAT 

CAACATTTCT 

tctggtgctt 



060 

120 

180 

2^0 

300 
(716) 

360 
(736) 

A20 
(737) 

A80 

SAO 

600 

660 

670 

060 
120 
180 
2A0 
300 

(752) 
360 

(772) 
A20 

(775) 
A80 

(781) 
SAO 

(801) 
600 

(821) 
660 

(830) 
720 
780 
SAO 
900 
935 

060 
120 
180 
2A0 
300 
360 
(8A0) 
A20 
A80 
SAO 
600 
660 
720 
758 
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ggggatggaa ttaggtagtt attctgattt ttAGATTTTT CATATCTTTT ATTTGGTCCA 060 
ATGAAGCAGA AAATTTAAAT GAAGTTATTA CCTTTGCCTG ATTTTTGACA CACCTCAAAC 120 
TATAACTTGA GGTTGCTAAC TATGAAACAC TGGCATTTAA TGATTTAAAG TAAAGAATTC 180 
TGTAATTTGT AGACTTCTGA GAAGTTCCAG AAAATAAATC A6ATGGTATG TAACAGCGAC 2A0 
ThrSerGl uLysPheGln LysIleAanG InMetValCy sAsnSerAsp (856) 
CGTGTGCTGA AAAGAAGTGC TGAAGGAAGC AACCCTCCTA AACCACTGAA AAAACTACGC 300 
ArgValLauL ysArgSerAl aGluGlySer AsnProProL yaProLeuLy sLysLeuArg (876) 
TTTGATATTG AAGGATCAGA TGAAGCAGAT GGAAGGTAGG AACCAGTTTT GAATGTTTTC 360 
PheAspIleG LuGlySerAs pGluAlaAsp GlySe. (888) 
CAGTAGCTGA GATGGTCATC TGGGGAATCC AGAGTCTCAG CACTGCTCCT GGCTTATACC A20 
AATTTCa^TTC ATGCCAAGTT TATTTGGAAG TTGTGAGAAT GGCTCAAAAT AATAGATATG 480 
AGTGTAGTGC AAAGTTAAAA ACATCTTACA AATTGCATAC CAACATTCAG TGAAGATATC 5/i0 
TAATAAAGCC TGATCTTTTT TACAAAGCTA TTGATAAAAT TTTGTTATTC TTAACATTAA 600 
ATTTAAAAAT GTTTACTCTT GAAAAATATT AACCACTGTA TTTTGTGAGA ACCACTGAAA 660 
AAATACATAG CATCATAAAT TTGTGACATT TATGTTTTAG ATGGTTAGTT TTTAAATTTT 720 
AAAATTAAAA GCTACTCACT AAAATAATAG CATAAAGTAA GTCATCGAAA GCATCATAGT 780 
TACTGGAAAT TTGAGTTTTC CATTTATAAA TACACATGAA ATGTTTTGCA TTTTTTTAAT 840 
CTGCAGTAAA CATCTCCCAG GAGAGTCCAA ATTTCAGGAG AAACTGGCAG AAATGAGTAA 900 
rLys HlsLeuProG lyGluSerLy sPheGlnGln LysLeuAlaG luMetT (905) 
GTACTTTTTT CACCTTGTGT AAACGAAATA AACAATTGTT TACACTGCAA GAAGTCTTTT 960 
CGTTATATAA AAGAATGTAT AATTTCTTCA GTTGGCAGGT TTGTTTATGC ATTTAAAATA 1020 
TAATTCAATC AAGGTTATTT ATCfACAAAC ATTTGTGQAT TAAATGTATG ATGTAAAATG 1080 
AAGGTCATTT TTACCCTTTC TATGATCTTT CATGCAGGAA GACIAAGAAG TGAAACATTG lUO 
CTTGACCACA TTCAAcacaa atggctacag ttagaaaata ctttagcaga actacaaaga 1200 
ggaactattt gggagtgtta gatataggga aaagttttat aaacctagca tatgtaaaac 1260 
atcatcaccc ttatttaagg aataaccttt gattctaccg atttttaaac • niQ 

"1,7 kb" V 
tctagctatt tgaatatgca gtaaattaac tgtaactcct acggtactgt caaatactag 060 
ffm!^^^^'^*' acctcttttt goAAGGTCCT GAGCGCCATC AGTTTGACAT GAGCATAATA 120 
TATATGGCAG CCACTTGCCA ACTTACCCAG TACGATCAAT GCTGTTAACA GTTCTTCATC 180 
CTTTTTCCAG CTTCTACTCG AACACGAATG CAAAAGCAGA AAATGAATGA TAGCATGGAT 2A0 
hrSerThrAr gThrArgMet GlnLysGlnL ysMetAsnAs pSerMetAso (921) 
ACCTCAAACA AGGAAGAGAA ATGAGGATCT CAGGACCTT6 GTGGACACTG TGTACACCTC 300 
ThrSerAsnL ysGluGluLy sEND 3« untranslaUa-region begins hera (928) 
TGGATTCATT GTCTCTCACA GATGTGACTG TATAACTTTC CCAGGTTCTG TTTATGGCCA 360 
CATTTAATAT CTTCAGCTCT TTTTGTGGAT ATAAAATGTG Cy^GATGCAAT TGTTTGGGTG 420 
ATTCCTAAGC CACTTGAAAT GTTAGTCATT GTTATTTATA CAAOATTGAA AATCTTGTGT 480 
AAATCCTGCC ATTTAAAAAG TTGTAGCAGA TTGTTTCCTC TTCCAAAGTA AAATTGCTGT 540 
GCTTTATGGA TAGTAAGAAT GGCCCTAGAG TGGGAGTCCT GATAACCCAG GCCTGTCTGA 600 
CTACTTTGCC TTCTTTTGTA GCATATAGGT GATGTTTGCT CTTGTTTTTA TTAATTTATA 660 
TGTATATTTT -TTTAATTTAA CATGAACACC CTTAGAAAAT GT6TCCTATC TATCTTCCAA 720 
ATGCAATTTG ATTGACTGCC CATTCACCAA AATTATCCTG AACTCTTCTG CAAAAATGGA 780 
TATTATTAGA AATTAGAAAA AAATTACTAA TTTTACACAT TAGATTTTAT TTTACTATTG 840 
GAATCTGATA TACTGTGTGC TTGTTTTATA AAATTTTGCT TTTAATTAAA TAAAAGCTGG 900 
AAGCAAAGTA TAACCATATG ATACTATCAT ACTACTGAAA CAGATTTCAT ACCTCAGAAT 960 
GTAAAAGAAG TTACTGATTA TTTTGTTCAT CCAACTTATG TTTTTAAATG AGGATTATTG 1020 
ATAGTACTCT TGGTTTTTAT ACCATTCAGA TCACTGAATT TATAAAGTAC CCATCTAGTA 1080 
CTTGAAAAAG TAAAGTGTTC TGCCAGATCT TAGGTATAGA GGACCCTAAC ACAGTATATC 1140 
CCAAGTGCAC TTTCTAATGT TTCTGGGTCC TGAAGAATTA AGATACAAAT TAATTTrACT> 1200 
CCATAAACAG ACTGTTAATT ATAGGAGCCT TAATTTTTTT TTCATAGAGA TTTGTCTAAT 1260 
TGCATCTCAA AATTATTCTG CCCTCCTTAA TTTGGGAAGG TTTGTGTTTT CTCTGGAATOi 1320 
GTACATGTCT TCCATGTATC TTTTOAACTG GCAATTGTCT ATTTATCTTT TATTTTTTTA: 1380 
AGTCAGTATG GTCTAACACT GGCATGTTCA AAGCCACATT ATTTCTAGTC CAAAATTACa" 1440 
AGTAATCAAG GGTCATTATG GGTTAGGCAT TAATGTTTCT ATCTGATTTT GTGCAAAAGC 1500 
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EXON 27 mAAATTAA AACAGCTGCA TTAGAAAAAG' AGGCGCTTCT CCCCTCCCCT ACACCTAAAG 1560 

(CONTD) 3' untranslated region (continued) 

GTGTATTTAA ACTATCTTGT GTGATTAACT TATTTAGAGA TGCTGTAACT TAAAATAGGG 1620 
GATATTTAAG GTAGCTTCAG CTAGCTTTTA GGAAAATCAC ■ TTTGTCTAAC TCAGAATTAT 1680 
TTTTAAAAAG AAATCTGGTC TTGTTAGAAA ACAAAATTTT ATTTTGTGCT CATTTAAGTT 17A0 
TCAAACTTAC TATTTTGACA GTTATTTTGA TAACAATGAC ACTAGAAAAC TTGACTCCAT 1800 
TTCATCATTG TTTCTGCATG AATATCATAC AAATCAGTTA GTTTTTAGGT CAAGGGCTTA 1860 
CTATTTCTGG GTCTTTTGCf "ACTAAGTTCA CATTAGAATT AGTGCCAGAA TTTTAGGAAC 1920 
TTCAGAGATC GTGTATTGAG ATTTCTTAAA TAATGCTTCA GATATTATTG CTTTATTGCT 1980 
TTTTTGTATT GGTTAAAACT GTACATTTAA AATTGCTATG TTACTATTTT CTACAATTAA 20A0 
TAGTTTGTCT ATTTTAAAAT AAATTAGTTG TTAAGAGTC^TTAATGGTCTG ATGTTGTGTT 2100 

polyadenylation signal sequence *polyadenylation site 
CTTTGTATTA AGTACACTAA TGTTCTCTTT TCTGTCTAGG AGAAGATAGA TAGAAGATAA 2160 
CTCTCCTAGT ATCTCATCCA TTCCTAGCCT TTAAGGGGCT CTATATGCTA GAGATTTCCA 2220 
AATTTATTTC TTCAGCCCTG ATCTTTICAC AGAGGTCAAG Gcttttatag ccaacagaac 2280 
tcttgattcc tactcccntc tacccaatgt ctccaaatat aaactaaaat caaataaata 23A0 
aaaatctttt tt : 2352 
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